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FIELD OF THE INVENTION 

The present invention relates to an isolated polypeptide having protease activity and 
5 being homologous to Nocardiopsis proteases, .as well as Isolated nucleic acid sequences 
encoding it. The invention furthermore relates to nucleic acid constructs, vectors, and host 
cells, including transgenic plants and non-human animals, comprising these nucleic acid 
sequences, as well as methods for producing and using the protease, in particular within 
animal feed, tor example m fish, feed, 
10 The proteose of the invention is thermostable, and characteristic structural features of 

relevance for the thermostability of proteases of peptidase family S2A or Si E are disclosed, 

The protease of the invention furthermore effectively degrades the soybean Bowrnaa* 
Blrk Inhibitor, as well as other anti-nutritional factors such as soybean agglutinin, and the 
Kunffe trypsin inhibitor, end also Isolated soy storage proteins such as, glyclnin and beta- 
18 conglycinln, 

BACKGROUND OF THE INVENTION 

Proteases derived from Nocardiopste sp. NRRL 18282 and Nocardiopsis dmmmiM 
NRRL 18133 are dlsciosed in WO 88/0394?. The DNA and amino acid sequences of the 
m protease derived from NocaMopm sp. NRRL 18282 are shown in DK application no. 1998 
00013. WO 01/58278 discloses the use in animal feed of acid-stable proteases related to the 
protease derived from ■Noc®rdk}psf& sp. NRRt 18282, as well as a protease derived from 
Nmanihpsk alba DSM 14010. These proteases, however, are not thermostable, 

JP 2-2S5Q81-A discloses a protease derived from NoaardIpp$i$ sp. strain OPC-210 
as (PERM P-10508), however without sequence Information. The strain is no longer available, 
as the deposit was withdrawn. 

DO 200432P discloses a proteolytic preparation derived from Nmardfopm 
mmpmm strain ZIMET 48847, however without sequence Information. The strain appears 
to be no longer available, 
30 JP 2003284571 -A, published after the first filing date of the present invention, 

discloses the amino acid sequence and the corresponding DMA sequence of a protease 
derived from Nocanffopzis sp. TQA-1 (FERiVt F-IBWB). The sequence has been entered in 
GENESEQP with no. ADF43584. 

Thermostable proteases are described In the poor art, for example a protease from 
36 Tfrnfmrniomspom fmm YX Is described by Lao and Wilson in AppL Environ, Microbiol. 
82:4258-4289 (1998), and the sequence Was filed in the public databases as 
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sptrambi„088984. However, this protease Is not : -homologous to hkmmWops® proteases, as 
the percentage Identity to the proteases of the invention is below 60%, 

it Is an object of the present fnyention to provide thermostable proteases that are 
homologous to Nocaftftopsis proteases, in particular with a potential for use in animal feed 
5 and/or detergents. 

SUMMARY OF THE INVENTION 

A number of thermostable pmfeases waha Isolated and characterized, viz. a protease 
derived from ftfoc&offbffs/s dassenvillet s ubsn. dassonvlltai D Sfvl 43235 (see SEQ ID NOs: 1 

10 and 2); a synthetic Protease 2 2 (see SEQ ID NOs: 7 and 8); a protease L2a derived from 
NoG&rtliopm sp. DSM 18424 (see SEQ ID NOs; 9 and 10); and Protease -8 derived from 
Nocardbpsis alba DSfv! 1864? (see SEQ ID NOs: 11 and 12), 

In a flu! aspect, the Invention relates to an Isolated polypeptide having protease 
activity, and having a malting temperature (T«,) of at least 78*0, as determined by Differential 

is Scanning Calorfmetry (DSC) in a 10 mM sodium phosphate, 50 mU sodium chioride boffer, 
pH 7.0, using a constant scan rate of l.S^O/min, wherein the polypeptide is selected from the 
group consisting of; (a) a polypeptide having an amino acid sequence which has a degree of 
identity to amino acids 1 to 1 88 of SEQ ID NO: 2, 1 to 1 98 of SEQ ID NO: 8, 1 to 1 80 of SEQ 
ID NO: 10 s and/or 1 to 188 of SEQ ID mi 12 of at least 80%; {b} a polypeptide which Is 

SO encoded by a nucleic add sequence which hybridizes under low stringency conditions With 
either of nucleotides 4S^108£ of SEQ ID NO: 1, 577-1184 of SEQ ID NO:; 7, 588-1152 of 
SEQ ID NO: 9, and/or 502-1085 of SEQ 10 NO: 11: and (o) a polypeptide which is encoded 
by a nucleic acid sequence which has a degree of Identity to any one of nucleotides 400- 
1082 of SEQ ID NO; 1, 577-1164 of SEQ ID NO; % 886-1152 of SEQ ID NO: 9. and/or 502- 

28 10SS of SEQ ID NO: 11 of at least 80%. The invention also relates to Isolated nucleic acid 
sequences encoding such proteases; nucleic aeid constructs, vectors, and host ceils 
comprising the nucleic acid sequences; as well as methods for producing and using the 
proteases,. In particular within animal feed. 

in a second aspect, the Invention relates to: 

30 A. An Isolated polypeptide of peptidase family S2A and/or peptidase family SI E having 
protease activity, and havihg an amino sequence comprising at least one of the following 
amino acids at the position Indicated: 10Y f 24S S 38T, 42G t 49T, SIT, S3Q, 54N, 82S S 88Q-, 
87S, 89T, 91T, 92S, 96A, 99A, 118N, 120T, 122a 12SQ, 129Y, 130S, 131 L, 135N, 147F, 
151S, 185S, 168V, 171V, 1791, and/or 1881; preferably 1QY* 38T, 828, 99A, 118N, 1.2GT, 

35 122R, 125Q, 129Y, 180S, 13ft, 185S, and/or 171 Y; mora preferably together with at least 
one of 95P, 100V, and/or 1141; and/or together with £B35 * D81 * S143): wherein each 
position corresponds to a position of SEQ ID NO: 2; 



a The polypeptide of 8 which comprises at least one of the f ofiowing amino acids at the 
position indicated; 3ST, 92S, 1201,1250, 131 L, 13Sa 147F, ISIS, 16SS, and/or 171 Y; 

C, The polypeptide of A whlc^ comprises at least one of the following amino acids at the 
position indicated: 1DY, 24S, 42G, 49T, SIT, 53Q, S4N ; 82S, 880, 8?S, 69T, 91T S 98A, 99A, 

6 1 18N, 122R, 129Y, 1308, 166V, 1791, and/or 1881; 

D. An isolated polypeptide of peptidase family S2A andfer peptidase family 81 E having 
protease activity, and having an amino sequence comprising at least one of the fallowing 
amino acids at the position indicated: 258, 38T, 42^ 44S, 49Q, 54R, 62S, 60S, 918, 928, 
95A, 99Q, 1001, 114V, 12ST, 1250, 129Q, 1311, 135N, 147F, 1518, 16SS, 156F, 171 Y, 

10 17SN, 1791, 1808, 184L, and/or 185T; preferably 2SS, 38T, 42P, 44S, S4R, 82S, 1260, 
1311, 16SS, 171Y, 17SN, 170t s 180S, 184L, ahd/oY 185T; more preferably together with at 
least one of 24A, 51V, 53E, 86A, 87T, 981, and/or 186L; aMior together with (H35 + D61 * 
S143): wherein each position corresponds to a position of SEQ ID NO: 12, 

E. The polypeptide of D which comprises at least one of the following amino acids at the 
15 position indicated; 38T, 928, 120T, 125Q, 131L, 13SN, 14?F, 151S, 1858, and/or 171Y, 

F, The polypeptide of D which comprises at feast one of the following amino adds at the 
position indicated: 25S, 42P, 448, 49Q, 54R, 62S, 80S, 91 S, 95A, 99Q, 1001, 114V, 12SQ, 
188F, 176N, 179L, 160$, 184L, and/or 185T. 

G, The polypeptide of any one of A, B, C, p, E, and F which has a Tro of aifeast 78°C 
20 as measured by DSC So 10ml4 sodium phosphate, SO mM sodium, chloride, pH 7.0; and/or a 

relati ve acts vlty M pH9 and 80°Cof at least 0.40; 

H. The- polypeptide of any one of A, B, C, P, E, F, and O which has a percentage of 
identity of at least 60% to either of amino acids -166 to 188, preferably 1-188, of SEQ ID NO: 
2; amino acids -192 to 196, preferably 1-198, of SEQ ID NO: 8; amino acids -195 to 189, 

25 preferably 1-188 Of SSQ ID NO; 10; and/or to amino acids -187 to -1, preferably MS8, of 
SEQ ID NO; 12; 

i. The polypeptide of any one of A, B, C« O, E, F, G; and H which is 1} a bacterial 
protease; is) a protease of the phylum Actmobaci&m lii) of the class Aciinohacteria: iv) of the 
order Aciiaomymial&s v) of the family Noemdi&psamae; vi) of the genus hkmrdiopm; 

30 and/or a protease derived from vli) Nocardiopsia species such as Nooardiopm a&t, 
Noc&'rdbpsf& aikaliphm, N&oardmpms antarstmm Noaardiopsm jmsjm, NOGardiopsis 
composts, Nocardhpsis exha/ans, Nocwdiopsis hslopbifa, Nocardlopm ha/ofoterans.. 
Nocardtopsis kumanmm, Nopatdidpsb mm Noowdhps® !ucsntm$k>, Nocardiopsis 
meiaiilws, Nocardiopms symematammana, Nocamiopsk tmhalosL Nocardbpsis tmpha, 

m Nomrdiopsis imvdisoholae, NQcmjb^ : Xlnjtefig$n$fe i w Haeanthprn dassonviilei e,g. a 
protease derived from NoGardmpsm antam0m or ^caroVopsfs dsssonvM, for example 
Nocaniiopsis dassonvms DSM 43235, mcardiopsis sp, DSM 18424, or Nacatrjfapsis alba 



OBU -15647, such as a polypeptide with the amino acid sequence of the mature peptide parts 
of either of SEQ ID NOs; 2, 8, 10, or 12; 

j. An Isolated nucleic add sequence comprising a nucleic add sequence which 
encodes the polypeptide of any one of A, % £, D, E, F, G< H or !; 
5 K. A nucleic acid construct comprising the nucleic add sequence of J operably linked to 
one or more control sequences that direct the production of the polypeptide In a suitable 
expression host; 

L A recombinant expression vector comprising the nucleic acid construct of K; 
M . A recombinant host cell comprising the nucleic acid construct of K or the vector of t; 
ION, A method for producing a polypeptide of any one of A, B, C, D, £, F, S, H, or l s the 
method comprising: (a) cultivating a. recombinant host ceil of M to produce a supernatant 
comprising the polypeptide; and (e) recovering the polypeptide; 

O. A transgenic plant, or plant part., capable of expressing the polypeptide of any one of 

A, 8, C, D s E, P> G, H, or I; 
16 P. A transgenic, non-human animal, or products, or elements thereof, being capable of 

expressing the polypeptide of any one of A, 8, C, D, E, F f G, H, or I; 

Q. Use of at least one polypeptide as defined in A, 8, C, D, E, F, G, H, or I (i). in animal 

feed; (IS) iri the preparation of a composition for use in animal feed; (iii) for improving the 

nutritional value of an animal feed; (iv) for increasing digestible and/or soluble protein in 
20 animal diets; fv) for increasing the degree of hydrolysis of proteins in animal diets; and/or (vfj 

for the treatment of proteins; 

R An animal feed additive comprising at least one polypeptide as defined in any one of 
A, 8, C, D, E, F, G, H or I; end (a) at least one fahsolubie vitamin, and/or (b) at least one 
wator-spluhie vitamin, and/or (c) at least one trace mineral; 
25 8. An animal feed composition, preferably a fish feed, having a crude protein content of 
m to 800 g/kg and comprising at least one polypeptide as defined In any one of A, 8, G, 0, 
Ei F, G, H, or I; or at least one feed additive of R; 

T. A composition comprising at least one polypeptide as defined in any am of claims A, 
S, C, D, E, F, G, H, or 1, together with at least one other enzyme selected from amongst 

30 aipha-amyiase (EC 3.2.11), phytase (EC 3.1,3.8 or 3.1.3.26); xyianase (EC 3.2.1.8); 
galaetanase (EC 3.2.1.89); alpha-gaiactosidaae (EC 3.2.1.22); protease (EC 3A-.~), 
phospholipase At (W 3.1,132); phospholipase A2 (EC 3,11,4); iysophospholipase (EC 
3.1.15):: phosphatase C (3,1.4.3); phospholipase D (EC 3.1.4.4); and/or beia-giucanase 
(EC 3.2.14 or EC 3.2.1,6); as well as 

35 U. Use of at least one polypeptide as: defined in any one of A, B, C, D s E, F, G, H or I in 
detergents, 

In a third aspect, the invention relates to: 
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l An isolated polypeptide having protease activity, which after incubation for four hours 
at 37 S C and pH 8,5 has degraded at least 38% (preferably at least 40%, 45%, 50%, 55%, 
80%, 65%, 70%, 75%, 80%, or at least 81%} of the soybean Bowniau-BIrk inhibitor, said 
percentage degradation being determined as 100% minus the percentage Intensity of the 
S intact Bawman-Birk inhibitor band on a stained SDB-F&GE (Trfs-Glycin© 4-20%) gel after the 
incubation, relative to the intensity of the same band before the incubation, said intensities of 
said bands being determined by scanning the gel; Wherein the polypeptide is selected from 
the group consisting of: (a) a polypeptide having an amino acid sequence which has a 
degree of identity to amino acids 1 to 138 of SEQ IP MO: 2, 1 to 198 of SEQ ID NO; 8, 1 to 

10 189 of SEQ ID NO; 10, and/or 1 to 188 of SEQ ID NO: 12 of at least 60%; (b) a polypeptide 
which Is encoded by a nucleic acid sequence which hybridizes under low stringency 
conditions with either of nucleotides 499-1062 of SEQ ID NO: 1, 577-1164 of SEQ ID NO; 7, 
886-1 1 52 of SEQ ID NO: 9, and/or 502-1 06S of SEQ ID HO: 1 1 ; and (c) a polypeptide which 
is encoded by a nucleic acid sequence which has a degree of identify to any one of 

IS nucleotides 499-1082 of SEQ ID NO; 1, 577-1164 of SEQ ID NO; 7, 588-1152 of SEQ ID 
NO; 6, and/or 602-1 OSS of SEQ ID HO: 11 of at least 60%; 

It. The polypeptide of I, wherein the Incubation takes place in an incubation buffer 
containing 60 mU dimethyl glutaric acid, 150 mM Nad, 1 mU CaCi 2f 0,01% Triton X-1'00, pH 
6.5; 

30 lih The polypeptide of any one of I, or II, wherein, during the incubation, the ratio 
between the protease polypeptide and the Bowman-Birk inhibitor is 1:10, based da A^; 

IV. The polypeptide of any one of I, U, or ill, wherein the gel Is stained with Ceomassle 
Brilliant Blue; 

V. The polypeptide of any one of I, it. III, or IV, wherein the Bowman-Birk inhibitor Is 
25 Sigma T-9777; 

VI. The polypeptide of any one of I, II, ill IV, or V, being further capable of degrading at 
least one of the following purified soy proteins: Soybean agglutinin (SBA), the Kunite trypsin 
Inhibitor, glycinin, and/of befa-conglycinin, using the principles as described In any one of i, 
il, III, IV, or V; 

3d VII The polypeptide of claim VI which has degraded at least one of the purified soy 
proteins to an extent of at least 10%, preferably at least 20%, 30%, 40%, 50%, 80%, 70%, 
80%, 90%, 92%, 94%, 98%, or at least 98%; 

VIII. An isolated nucleic add sequence comprising a nucleic acid sequence which 
encodes the polypeptide of any one of I, II, 111, IV, V, VI, or Vll; 
35 IX. An Isolated nucleic acid sequence .comprising a nucleic acid sequence which 
encodes a polype ptlde having protease activity* which polypeptide, after incubation for four 
hours at 37 $ C and pH 8,5, has degraded at least 38% (preferably el least 40%, 45%, 50%, 



m%, 60%, 65%, 70%, 75%, 80%, or at least 81%) of the soybean Bowman-Si* inhibitor, 
said percentage degradation being determined as 100% minus the percentage Intensity of 
the intact Bawman-Bifk inhibitor band on a stained SDS#AOE (Tris-Qiydne 4-20%) gel after 
the incubation, relative to the intensity of the same band before the incubation, said 
S intensities of said bands being determined by scanning the gei; wherein the nucleic acid 
sequence (a) hybridises under low stringency conditions with either of nucleotides 489-1 002 
of SEQ ID NO: 1, 577-1184 of SEQ ID NO: 7, 686 -1152 of SEQ ID NO; 9, and/or 502-1065 
of SEQ ID NO; 11; <b) has a degree of identity to any one of nucleotides 489-1082 of SEQ ID 
NO: 1577-1164 of SEQ iD NO: 7, 586-1152 of SEQ ID' NO: 8, and/or SG2-1G8S of SEQ ID 
10 NO; 1 1 of at least 60%: and/or (c) encodes a polypeptide which has a degree of identify to 
amino acids 1 to 188 of SEQ ID NO; 2, 1 to 188 of SEQ ID NO; 8, 1 to 169 of SEQ ID NO: 
10, and/or 1 to 188 of SEQ ID NO: 12 of at least 60%; 

X. A nucleic add construct comprising tie nucleic acfcf sequence of any one of VIU, or 
IX, operabiy linked to one or more control sequences that direct the production of the 
1 5 polypeptide In a suitable expression host- 
XL A recombinant expression vector comprising the nucleic: add construct of X; 
Ml A recombinant host ceil comprising the nnoielo add construct of X or the vector of XI; 

XIII. A method for producing a polypeptide of i, the method comprising; (a) cultivating a 
recombinant host ceil of Xli to produce a supernatant comprising the polypeptide; and (b) 

20 recovering the polypeptide; 

Xili. A transgenic plant, or plant pad, capable of expressing the polypeptide of I; 

XIV. A transgenic, #jW*a«iarr animal, of products, or elements thereof, being capable of 
expressing the polypeptide of I; 

XV. A method for producing a polypeptide of I, the method comprising (a) cultivating any 
25 one of the following stains: $ NocatrMopsm dasson* snbsp. mssonvifiw OQU 43235, (ih 

Nocardiopm sp. DSU 18424, or (iii) Nocstrdiopm alba DBU 15647; and (b) recovering the 
polypeptide; 

XVI. Use of at least one polypeptide as defined In I (I) in animal feed; 01) In the preparation 
of a composition for use in animal feed; {ill} for improving the nutritional value of an animal 

SO feed; (iv) for increasing digestible and/or soluble protein In animal diets; (v) tor Increasing: the 
degree of hydrolysis of proteins in animal diets; and/or (vt) for the treatment of proteins; 

XVII. An animal teed additive composing at least one poiypeptide as defined in I; and (a) at 
least one fat-soluble vitamin, and/or (b) at least one water-eoiuble vitamin, and/or (c) at least 
one trace mineral; 

35 iiXX. An animal feed composition having a crude protein content of 50 to 800 g/kg and 
comprising at least one polypeptide as defined in I, or at: least one feed additive of XVII: 
IXX, The feed composition of IIXX which Is a fish feed; 

6 



XX. A composition comprising at least one polypeptide as deflrssd in 1, together with at 
feast one oiler enzyme selected from amongst alphs-amyiase (EC 3.2.11), phytase (EC 
3.1,3.8 or 3.1.3.26):; xylanase (EC 3»2<lS); galaefanase (EC 3.2,1.89); aipha-galactosidase 
(EC 3,2,122); protease (EC 34.-.-X pHc^p^ip^se' At •{EC-&11,.32};- phospholipase A2 (EC 
§ 3,1.14); lysophospholipase. (EC 3.1.15); phosaholipase C (3,14.3); phosphMpase D (EC 
3.14.4); and/or heta-glucanase (EC 3,2,14 or EC 3,2.18); as well as 
•XXI. Use of at least one polypeptide as denned in \m detergents. 
In a fourth aspect, the invention rentes to; 

a. An isolated polypeptide having proteose activity, selected from the group consisting 
10 of: (a) a polypeptide having an amino add sequence which has a degree of identity (i) to 

amino adds 1 to 188 of SEC? ID NO: 2 of at least 88%, (II) to amino adds 1 to 196 of SEQ ID 
NO: 8 of at least 85%. (ill) to amino acids 1188 of SEQ ID NO: 10 of at least 85%. and/or a 
degree of identify to amino acids 1-188 of SEQ 10 NO: 12 of" at least 89% ; (b) a polypeptide 
which is encoded by a nucleic acid sequence which hybridizes under medium-high 

15 stringency conditions with (i) nucleotides 499-1082 of SEQ ID NO; 1 (II) nucleotides 577- 
1184 of SEQ ID NO: 7, (iii) nucleotides 588-1152 of SEQ ID NO: 9, (iv) nucleotides 502-1088 
of SEQ ID NO: 1 1, (v)a subsequence of any one of (l)-(iv) of at least 100 nucleoiides; and/or 
(vij a complementary strand of any one of (rMv); (c)a variant of the polypeptide having an 
amino acid sequence of (i) amino acids 1 to 188 of SEQ ID NO: 2, (si) amino acids 1 to 198 of 

20 SEQ ip .NO': 8, OH) amino adds 1-189 of SEQ ID NO: 10. or amino acids 1-188 of SEQ ID 
NO: 12, comprising 8 substitution, deletion, extension, and/or Insertion of one or more amino 
acids; (d) an allelic variant of (a), (b), or (o); and (e) & fragment of (a), p% (c) s or (&). that has 
protease activity; 

b. An isolated nucleic acid sequence comprising a nuoieic acid sequence which 

as (a) encodes the polypeptide of a; (b) encodes a polypeptide having protease activity, and 
which hybridlKds under medium-high stringency oondliions with (i) nucleotides 499-1082 of 
SEQ ID NO; 1 , (ii) nucleotides 577-1 184 of SEQ ID NO: 7, (Hi) nucleotides 586-1 152 of SEQ 
ID MO: % (iv) nucleotides 502-1065 of SEQ ID NO: 11 (v) a subsequence of any one of (i> 
(iv) of at feast 100 nucleotides; and/of (yl) a complementary strand of any one of (i)~(v); 

30 and/or (c) encodes a polypeptide having protease activity and which has a degree of identity 
(i) to nucleotides 499-1082 of SEQ ID NO: 1 of at least 86%, (Ii) to nucleotides 577-1 184 of 
SEQ ID NO: 7 of at least 85%, (Hi) to nucleotides 688-1 182 of SEQ 10 NO: 9 of at least 85%, 
(Iv) to nucleotides 502-1085 of SEQ ID NO: 11 of at least 89%; 

c. An isolated nucleic acid sequence produced by (a) hybridizing a DMA under medium- 
3S high stringency conditions with (s) nucleotides 4§S.10§2 of SEQ ID NO: 1 (Ii) nucleotides 

S77-1 184 of SEO ID NO: 7, p) nucleotides 588-1 iS2 of SEQ: ID NO; 9, (iv) nucleotides 502- 
1 085 of SEQ ID NO: 11 (v) a subsequence of any one of (ij-(iv) of at least 100 nucleotides; 



and/or (vf) a complementary strand of any one of and (b) Isolating the nucleic acid 
sequence; 

ti, A nucleic acid construct comprising the niioieSc sold sequence of any one of b f or c, 
operahly linked to one or more control sequences that direct the production of the 
5 polypeptide In a suitable expression best; 

&, A recombinant expression vector comprising fee puciefc acid construct of d; 

f. A recombinanl host cell comprising the nucleic acid construct of d or the vector of e: 

g, A method for producing a polypeptide of a s the method comprising; (a) cultivating a 
recombinant host cell of f to produce a supernatant comprising the polypeptide; and 03} 

10 recovering the polypeptide; 

k A transgenic plant, or plant part, capable of expressing the polypeptide of a; 

i. A transgenic, non-human animal, or products, or elements thereof j being capable of 

expressing the polypeptide of a; 

j, A method for producing a polypeptide of a, the method comprising (a) cultivating any 
15 one of the following strains; (i) Nocardkpsh dassotrMeJ mbsp. da$$mvi!lei DSP 43235, (si) 
Ncmntiops® sp, DSM 16424, or Nooartftopsis alba DSM 15847; and (h) recovering the 
polypeptide; 

Jr. Use of at least one polypeptide as defined In a (!) in animal feed; (II) in the preparation 
of a composition for use In anirnai feed; (ill) for Improving the nutritional value of an animal 
20 feed; {IV} for inaeasing digestible and/or soluble protein In animal diets; {v} for Increasing the 
degree of hydrolysis of proteins in anirnai diets; and/or (vi) for the treatment of proteins; 
I. An animal feed additive composing at least one polypeptide as defined In a; and (a} 
at least one fat-soluble vitamin, and/or (b) at least, one water-soluble vitamin, and/or (c) at 
least one trace mineral; 

25 m. An anirnai feed composition having a crude protein content of SO to 800 g/kg and 
comprising at least one polypeptide as defined in a, or at least one feed additive of ?; 
a. The feed composition of m which is a fish feed; 

o. A composition comprising at ieast one polypeptide as defined in a, together with at 
least one other enzyme selected from amongst aipha-arnyiase (EC 3.2.1,1), phytase (EC 

30 3, 1 .3.8 or 3. 1 .3.28); xyianase (EC 3.2,1 M); gaiactansse (EC 3.2.1 .38); aipha^alaelosidasa 
(EC 3,2.122); protease (EC 3.4,-.-}, piiosphollpase A1 (EC 3.1.1.32); phosphoiipase A2 (EC 
3.1.1.4}; lysophosphoiipase (EC 3.1.1.5); ptrasphollpase C (3.1.4.3); phosphatase D (EC 
3.1A4); and/or beta-giucanase (EC 3.2.1,4 or EG 3.2.1.6}; as well as 
p. Use of at least one polypeptide as defined in a In detergents, 

36 In a fjfth aspect, the invention relates to: an isolated polypeptide having protease 

activity, selected from the group consisting of; (a) a polypeptide having an amino add 
sequence which has a degree of identity to amino acids 1 to 133 of SEQ ID NO: 2, 1 to 188 
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of SEQ ID NO; 8, 1 Ho 1 96 of SEQ ID NO; 8» and/or to of at least 84%; (b) a polypeptide ha- 
ving an amino acid sequence which lias a degree of identity to amino acids -166 to 188 of 
SEQ ID NO: 2, and/or -iS2 to 188 of SEQ ID NO: % of at feast '75%; (e) a polypeptide which 
is encoded by a nucleic add sequence which hybridises under medium-high stringency 
5 conditions with (i) DNA encoding a protease obtainable from genomic DMA from 
M>card/opsls dassmwiBm subspmiss da«sonv?//a/ DSM -4323S by use of primers SEQ. ID 
NOs: 3 and 4; (IS) nucleotides 499-1062 of SEQ iD NO; 1, an&or 677-1 164 of SEQ ID HO: 7; 
Oil) nucleotides 1-1062 of SEQ ID HO; 1 S and/or 1-1164 of SEQ ID NO: ?; 0v) a subsequence 
of (I) or 01) or (is!) of at least 100 nucleotides; mvdim (vj-a complementgjy strand of (I), (it). (Hi) 
io or (tv); (d) a variant of the polypeptide having: an amino add sequence of amino adds 1 to 
188, or -166 to 188 of SEQ ID NO: 2, or amino adds 1 to 196, or -192 to 196 of SEQ SD NO: 
8, comprising a substitution, deletion, extension, and/or insertion of one or more amino acids; 
(e) an allele variant of (a), (b) or (c); and (f) a fragment of (a), (b), (c), (d) or (e) that has 
protease activity; 

IS An isolated polypeptide having protease activity, and having a melting temperature 

C4) of at least ?8 n C, as determined by Differential Scanning Calorlmetry (DSO) in a 10 nlM 
sodium phosphate, 50 mM sodium chloride buffer, pH 7 A using a constant scan rat© of 1.5 
0 D/min, wherein ihe polypeptide is selected from the group consisting of: (a) a polypeptide 
having an amino acid sequence which has a degree of identify to amino acids 1 to 188 of 

20 SEQ ID NO; 2, 1 to 188 of SEQ ID NO: 8, and/or 1 to 196 of SEQ ID NO: S, of at least 50%; 
(fo) a polypeptide having an amino acid sequence which has a degree of Identity to amino 
acids -166 to 188 of SEQ !D NO: 2, and/or -192 to 196 of SEQ SD NQ: 8, of at least 50%; (c) 
a polypeptide which is encoded by a nucleic acid sequence which hybridises under low 
stringency conditions with DNA encoding a protease obtainable from genomic DNA from 

26 Nocerdi'opsis a'assonviUes subspecies dassonviilei DSM 43235 by use of primers SEQ ID 
NOS. 3 and 4; nucleotides 499-1062 of SEQ ID NQ: 1, and/or 577-1164 of SEQ ID NO; 7; 
nucleotides 1-1062 of SEQ ID NO: 1, and/or 1-1164 of SEQ iD NO: 7; a subsequence of (I) 
or (ii) or p) of at least 100 nucleotides; and/or a complementary strand of 0), (si), (iii) or (iv); 

(d) a variant of ihe polypeptide having an amino add sequence of amino acids 1 to 188, or 
30 -166 to 188 of SEQ iD NO: 2, or amino acids 1 to 198, or -192 to 196 of SEQ ID NO: 8, 

comprising a substitution, deletion, extension, and/of insertion of one or more amino acids; 

(e) an allelic variant of (a),{b) or (g); and if) a fragment of (a), (b), (c), id) or (e) that has 
protease activity; 

An isolated nucleic add sequence composing a nucleic add sequence which (a) 
SS encodes the polypeptide of daim 1; (b) encodes a polypeptide having protease activity, and 
which hybridizes under medium-high stringency conditions with (i) DNA encoding a protease 
obtainable from genomic DNA from Nocm^>si$-,dsmcmM subspecies daseonw&i DSU 



43235 by use of primers SEQ. ID NOS. 3 and 4; fii> nucleotides 488-1082 or 1-1082 of SEQ 
ID NO; t, and/or 577-1 164 or 1-1164 of SEQ ID HQ- 7; (Iii) a subsequence of (i) or 00 of at 
(east 100 nucleotides; apd/or (Iv) a complementary strand of (i), 0!}, or (i); (e) encodes a 
polypeptide having protease activity ami which has a degree of tdentlty to nucleotides 499- 
8 1 082 SEQ ID NO: 1 , and/or §77-1 164 of SEQ ID HO: 7, of at least 85%; and/or (d) encodes 
a polypeptide having protease activity and which has a degree of identity to nucleotides 1- 
1062 SEQ SD NO: 1 and/or nucleotides 1-1 164 of SEQ ID' NO: 7 S of at least 81%; 

An isolated nucleic add sequence ccpipnstng a nucleic acid sequence which 
encodes a polypeptide having protease activity and a melting temperature (T w ) of at least 

10 78*C, as determined by Differential Scanning Caiorimefry (DSC) in a 10 rsM sodium 
phosphate, 50 mM sodium chloride buffer, pH 7,0, using a constant scan rate of 1.5 ;> G/min, 
wherein the nucleic acid sequence (a) encodes the above polypeptide; (h) hybridizes under 
low stringency conditions with DNA eroding a protease obtainable from genomic DMA from 
Noc&rdlopsis dassonviHes subspecies dassonviliei DSM 4323S by use of primers SEQ ID 

1 5 NO's: 3 and 4; nucleotides 498-1062 or 1-1062 of SEQ ID NO: 1 , and/or 577-1 164 or 1-1164 
of SEQ ID NO: 7; a subsequence of (i) or (it) of at least 100 nucleotides; and/or a 
complementary strand of (1), (8), or (iii); {«) has a. degree of identity to nucleotides 499-1062 
of SEQ ID NO; 1 , and/or 577-1 164 of SEQ ID NO: 7, of at feast 50%; and/or (d) has a degree 
of identity to nucleotides 1-1062 of SEQ ID NO; 1, and/or 1-1104 of SEQ ID NO; 7, of at least 

20 50%; 

An Isolated nucleic acid sequence produced by (a) hybridizing a DNA under medium- 
high stringency conditions with DNA encoding a protease obtainable from genomic DNA from 
Afecard/qps/s dassmvfflei subspecies dassoiwillei DSM 43235 by use of primers SEQ ID 
NOS, 3 and 4; nucleotides 499-1062 or 1-1062 of SEQ ID NO; 1 , and/or 577-1 164 or 1-1 164 
25 of SEQ ID NO: 7; a subsequence of (!) or (it) of at least 1 00 nucleotides; or a complementary 
strand of (i), (is) or {iii}; and (b) isolating the nucleic acid sequence; 

A nucleic acid construct comprising the above nucielc acid sequence operahiy linked 
to one or more control sequences that direct; the production of the polypeptide In a suitable 
expression host; 

30 A recombinant expression vector comprising the nucleic acid construct; 

A recombinant host ceil comprising the nucleic acid construct of the vector; 
A method for producing the above polypeptide, the method comprising: (a) cultivating 
a recombinant host cell of claim 8^p|^i^\a'##e^teftt..«»mprlsing the polypeptide; and 
(b) recovering the polypeptide; 
35 A transgenic plant, or plant part, capable of expressing .-the above polypeptide; 

A transgenic, non-human animal or products, or elements thereof being capable of 
expressing the above; polypeptide; 
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Use of at feast one of the above polypeptides (!) in animal feed; {%) In the preparation 
of a composition for use in animal feed; pi) for improving tie nutritional vaiue of an animal 
feed; (iv) for Increasing digestible ^^^^^.^ij^%.-arAm^'-0^i fv) for Increasing the 
degree of hydrolysis of proteins in animal diets; and/or (vi) for the treatment of proteins; 
5 An animal feed additive comprising at least one polypeptide as defined above: and 

(a) at least one fat-soluble vitamin, and/or {$} at least one wafer-soluble vitamin, and/or (e) at 
least one trace mineral; 

An animal feed composition having a emtio protein content of SO to 800 g/'kg and 
comprising at least one polypeptide as defined above, or at least one feed additive; 
10 A composition comprising at least one polypeptide as defined above, together with at 

least one other enzyme selected from amongst aipha-amylase {EC 3.2.1,1), phytase (EC 
3.1.3.8 or 3.1 3.26); xyianase (EC 3.2,1*8); galaetanase (EC 3.2.1,88); alpha-gaiaetosidase 
(EC 3.2,1.22); protease (EC 3.4.-,-), phosphatase A1 (EC 3.1.1.32): pnosphoiipase A2 (EC 
3,1,1.4); iysophospboiipase (EC 3.1.1.S); phosphoilpase C (3.1.4.3); phospholipase B (EC 
is 3,1 .4,4); and/or befa-giueanase (EC 3,2.1,4 or EC 3,2,1 ,8); 

Use of at least one polypeptide as defined above in detergents; 
An Isolated polypeptide of peptidase family 82A and/or peptidase family S1E haying 
protease activity, and having an amino sequence comprising at least one of Ihe fallowing 
amino adds at the position indicated: 10Y, 24S, SSI* 42G, 49T, SIT, 53Q:, 54U, 82S, 8853, 
20 B7B, S9T, 91T, 92S, 96A, 99A, 118N, 120T, 122R 125Q. 129Y, 130S, 131 L, 135N, 147F, 
1518, 185$, 168V, 171 Y, 1791, and/or 1861; preferably 10Y, 38T, 82S, 9SP, 89A, 100V, 1141, 
T18N, 120T, 122a 125Q, 129Y, 130S, 131L, tms t and/Or 171 Y; more preferably together 
with at least one of 95P, 100V, and/or 1141: and/or together with (H35 + D61 + S143); 
wherein each position corresponds to a position of SEQ ID NO: 2; 
25 The above family S2A or S1E polypeptide which comprises at least one of the 

following amino acids at the position indicated: 38T, 92S, 120T, 125Q, 1311, 135N, 147F, 
151S, 16SS, and/or 171Y; 

The above family S2A or S1E polypeptide which comprises at least one of the 
following amino acids at the position indicated: 10Y\ 243, 42% 49T. 51 T, 53Q, 54N, 82S, 
30 88Q, 878, S9T, 91T, 98A, 98A, 118N, 122R, 129Y, 130S, 188V, 1791, and/or 1881; 

The above family S2A or S1£ polypeptide which has a Trn of at least 7B"C as 
measured by DSC in 1GmM sodium phosphate, 50 mM sodium chloride, pH 7.0; and/or a 
relative activity at pH9 and 80 C C of at least 0.40; 

The above family S2A or S1E polypeptide which has a percentage of Identity to 
3S amino acids 1 -188, or -1 88 to 188, of SEQ IS NO; 2, and/or to amino acids 1 -1 98, or -1 92 to 
196, of SEQ ID NO; 8, of at least 50%; 
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The above family B2A or 81 E polypeptide which is f) a bacterial protease; H) a 
protease of tie phylum; ^e&no&aofew; i) ,df the class Actmobaciem; iv) of the order 
Actmsmymiates v) of the family Noearcftopmrnaei vi) of the genus Atocafifops/s; and/or a 
protease derived from vii) Nmardhpsla species such as Nm&rdiopsis alba, Nocanliopm 
s mtarutica, Noc&rdbpsh prnslna, Nomrdiopsis composts, Nocardtopsb axhalans, 
Nocatxtiopsis haiopftiia, Nocardbpsis hatoMemns. Nomrdlopsi&kumanemm, Nooartfiopsis 
fmieil, Nocardiopsis luceniemis, Namrdbps® meta$Gua> NQcara'iopsis aynmmataformam, 
Noci*rdi&psi$ tmhatost, Nocardtopm iropfca, NocardiopSfs omicHsohoteB, Nocardhpm 
xinjmngensis, or NQoaaJiopsis dassomftes, e.,g. a protease derived from Noc&txtiopsis 
10 anfawSca or Nocardiosis daseorn4M ; for example Nocardhpsis rHassonvM&i DSM 43235, 
such as a polypeptide with the amino add sequence of amino acids 1 to 188, or -168 to 188, 
ofSEQ ID NO: 2; 

An isolated nudeic acid sequence comprising a nucleic acid sequence which 
encodes the above family S2A or S1E polypeptide; 
15 A nucleic acid construct comprising the nucleic add sequence operahly linked to one 

or more control sequences thai direct the production of the polypeptide in a suitable 
expression host; 

A recombinant expression vector comprising the nucleic acid construct; 

A recombinant host ceil comprising the nucleic acid construct or the vector; 
20 A method for producing the above family S2A or S1E polypeptide* the method 

comprising; (a) cultivating a recombinant host ceil to produce a supernatant comprising the 
polypeptide; and (b) recovering the polypeptide; 

A transgenic plant, or plant part, capable of expressing the above family S2A or 81 E 
polypeptide; 

as A transgenic, non-human animal, or products, or elements thereof, being capable of 

expressing the above family S2A or S1g polypeptide; 

Use of at least one of the above S2A or S1E polypeptides (i) In animal feed; (ii) in the 

preparation of a composition for use in animal feed; {ill} for improving the nutritional value of 

an animal feed; (Iv) for increasing digestible andVor soluble protein in animal diets; (v) for 
SO Increasing the degree of hydrolysis of proteins in animal diets; and/or (vi) for the treatment of 

proteins; 

An animal feed additive comprising at least one one of the above S2A or S1E 
polypeptides; and (a) at least one M-aoiuble vitamin, and/or (b) at least one water-soluble 
vitamin, and/or (c) at least one trace mineral; 
35 An animal feed composition having a crude protein cont«*«i of 50 to 800 g/kg and 

comprising at least one one of foe above S2A or 81 E polypeptides, or at least one above 
feed additive; 
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A composition comprising at least one one of the above S2A or S1E polypeptides, 
together with at least one other enzyme selected from amongst aipha-amyiase (EC 3,2, 1.1), 
phyiase (EC 3.1,3.8 or 3,1.3.26); ^laoase (EC 3.2, 1 gaiadtanase (EC 3,3.189); alpha- 
galaoiosldase (EC 3.2,1.22); protease (EC 3A~.~) ? phosphollpase A1 (EC 3,1,1,32); 
§ phospholipase A2 (EC 3.1.1.4); iysophosphQllpasa (EC 3.1,1.8); phosphatase C (3,1,4.3); 
phospholipase D (EC 3,1.4,4); and/or beta-glueanasa (EC 3.2,1.4 or EC 3.2,1.6); as well as 
Use of at least on® one of the above S2A or S t E polypeptides in detergents, 
The above second, third, fourth, apd fifth aspects, are, independently of each other, 
also preferred sub-aspects of the -first aspect of the invention, as well as preferred sub - 
10 aspects of each other. 

DETAILED DESCRIPTION OF THE INVENTION 

Polypeptides having protease activity, or proteases, are sometimes also designated 
peptidases, proteinases, peptide hydrolases, or proteolytic enzymes. Proteases may fee of 
15 the exo-type that hydropses peptides starling at either end thereof, or of the endo-type that 
act internally In polypeptide chains (endop©ptid$ses). Endopeptidases show selivity on N- 
and Oterminally blocked peptide substrates that are relevant for the specificity of the 
protease in question. 

The term : "protease" Is defined iierein as an enzyme that hydrolyses peptide bonds. It 
20 includes any enzyme belonging to the EC 3.4 enzyme group (including each of the thirteen 

subclasses thereof), The EC number refers to Enzyme Nomenclature 1982 from NC-IUBMB, 

Academic Press, Sen -Diego, California,, including supplements 1-6 published In Eur, J. 

Biochem. 1994, 223, 1-5; Eur. J. Biochem. 1905, 232, 1-6; Eur. J. Biochem. 1998, 237, 1-5; 

Httr. d, Sioohem. 1997, 250, 1-6; and Eur. J. Blochem, 1999, 264, 810-860; respectively, the 
25 nomenclature is regularly supplemented and updated; see e.g. the World Wide Web (WWW) 

at htip://mmch^ 

Proteases are classified on the basis of their catalytic mechanism info the following 
groups: Serine proteases (S), Cysteine proteases (C), Aspartle proteases (A), Metallo 
proteases {M% and Unknown, or as yet unclassified, proteases (U), see Handbook of 
30 Proteolytic Enzymes, AJ.Sarrett, N.O.Rawlings, J,F.Woessner (eds), Academic Press 
(1998), in particular the general Introduction part 

In particular embodiments, the proteases of the invention and for use according to the 
Invention are selected from the group consisting of: 

(a) Proteases belonging to the EC 3,4.-,- enzyme group; 
35 (b) Serine pmfeases b#sng!h§ to the S group of the above Handbook; 

(e) Serine proteases of peptidase family S£&; and/or 
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(d) Serins proteases of peptidase family S1E as described In Bioehem.J. 
290:205-218 {1983) and in y£ROf% protease database, release 8.20, March 24, 2003, 
(VAW.roerops,aauk). The database is described in Bawlings, N.D., O'Brien, B, A & Barrett, 
A.J. (2002) MEROPSH the protease database. Nucleic Acids Res. 30,. 343-346, 
S For determining whether a given protease is a Serine protease, and a family S2A 

protease, reference is made to the above Handbook arid the principles indicated therein. 
Such determination can be carried out for ail types of proteases, be it naturally occurring or 
wild-type proteases; or genetically engineered or synthetic proteases. 

Protease activity can b© measured using any assay, in which a substrata is 

10 employed, that Includes peptide bonds relevant for the specificity of the protease In question. 
Assay-pH and assay-temperature are likewise to he adapted to the protease in question, 
Examples of assay-pH-vaiues are pH % 3, 4, 5, 6, 7, 8,9, 10, it, or 12. Examples of assay- 
temperatures are SO, 35, 3?, 40, 48, 50, 55, 60, 05* 70, 80, 00, or 95°C, 

Examples of protease substrates am casein, such as Azurina-Crosslinked Casein 

i s (AZCL-casein), Two protease assays are described in Example 2 herein, either of which can 
be used to determine protease activity, f or the purposes of this invention, the so-called pNA. 
Assay is a preferred assay. 

there are no limitations on the origin of the protease of the invention and/or for use 
according : to the invention. Thus, the term protease Includes not only natural or wild-h/pe 

£0 proteases obtained from microorganisms of any genus, but also any mutants, variants, 
fragments etc, thereof exhibiting protease activity, as well as synthetic proteases, such as 
shuffled proteases, and consensus proteases. Such genetically engineered proteases can be 
prepared as is generally known in the art, e.g. by Site-directed Mutagenesis, fey PGR (using 
a FOR fragment containing the desired mutate as one of the primers In the PGR reactions). 

25 or by Random Mutagenesis, The preparation of consensus proteins Is described in e.g. EP 
897085. Gene shuffling is generally described in e.g. WO 05/22825 and WO 06/00343. 
Recombination of protease genes can be made independently of the specific sequence of 
the parents by synthetic shuffling as described in Ness, J,£, et al, in Nature Biotechnology, 
Vol. 20 (12), pp. 1251-1255, 2002. Synthetic oligonucleotides degenerated in their DNA 

30 sequence to provide the possibility of all amino acids found in the set of parent, proteases are 
designed and the genes assembled according to the reference. The shuffling can be carried 
out for the full length sequence or for only part of the sequence and then later combined with 
the rest of the gene to give a full length seguence^ The proteases of the mature peptide parts 
of SEQ ID NOs: 2, 8, 10 and 12, such as Protease 22 (SEO ID NO: 8), and the protease 

35 derived from Nomnimpsls msmnm^-m^m^s,€^ssm^:UBM:02Z5 (SEQ ID NO: 2), 
are particular examples of such parent proteases which can be subjected to shuffling as 
described above, If desired together with the protease derived from Nocardiasis sp, HRRt 



18262, to provide additional proteases of the invention. The term "obtained from 8 as used 
herein in connection with a given source shall moan that the polypeptide encoded by the 
nucleic acid sequence is pfodpced by the source or by a cell in which the nucleic add 
sequence torn the source is present in a prefer embodiment, the polypeptide is secreted 
S exiraceliuiariy 

in a specific embodiment, the protease is a low-allergenjc variant; designed to invoke 
a reduced immunological response when exposed to animals, including man, The term 
immunological response Is to do understood as any reaction by the Immune system of an 
animal exposed to the protease. One type of immunological response is an allergic response 

10 leading to increased levels oflgE in the exposed animal, Low-aHergemc variants may be 
prepared using techniques known In the art. For example the protease may be corrugated 
with polymer moieties shielding portions or epitopes of the protease involved in m 
immunological response. Conjugation with polymers may involve in vitm chemical coupling of 
polymer to the protease, e.g. as described In WO 98/17929, WO 98/30682, WO 98/35026, 

15 and/or WO S9/0Q4S9, Conjugation may in addition or alternatively thereto involve in vivo 
coupling of polymers to the protease. Such conjugation may be achieved by genetic 
engineering of the nucleotide sequence encoding the protease, inserting consensus 
sequences encoding additional giycosylation sites in the protease and expressing the 
protease in a host capable of glycosylating the protease, see e,g, WO QQ/26304. another 

20 way of providing iow-allergenic variants Is genetic engineering of the nucleotide sequence 
encoding the protease so as to cause the protease to self-oiigomerfee, effecting that 
protease monomers may shield the epitopes of other protease monomers and thereby 
lowering the antigenicity of the oligomers. Such products and their preparation is described 
e g. in WO 98/16177. Epitopes involved in as immunological response may be identified by 

25 various methods such as the phage display method described in WO 00728230 and WO 
01/83559, or the random approach described in BP §81907. Once an epitope has been 
identified, its amino add sequence may be altered to produce altered immunological 
properties of the protease by known gene manipulation techniques such as site directed 
mutagenesis (see e.g, WO 00/28230, WO 00/26354- and/or WO 00/22103) and/or 

30 conjugation of a polymer may be done in sufficient proximity to the epitope for the polymer to 
shield the epitope. 

A polypeptide according to either aspect of the present Invention may comprise an 
amino acid sequence which has a degree of identity to me mature peptide part of either of 
SEQ ID NGs: 2, 8, 10, or 12, for example amino acids 1 to 188 of SEQ ID NO; 2, and/or to 
35 amino acids 1 to 1S6 of SEQ ID NO: 8 {the mature peptide parts), of, for example, at isast 
about 50%, and which have protease activity (hereinafter "homologous polypeptides"}- In 
particular embodiments, the degree of identity to either of the mature peptide parts of either 
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of SEQ ID NOs: 2 t 8, 10 or 12 is at feast about 62%, 64%, 66%, 88%, 70%, 72% : 74%, 76%, 
80%, 81%, 82%, 83%, 84%, 85%, 88%, 8?% 5 88% r 88% f 90%, 91%, 92%, 93%, 94%, 86%, 
96%, 97%, 98%, or at least 99%. in afternatlve embodiments, the degree of identity Is at 
least about 59%, 61%, 52%, 53%, 64%, 56%, §6%, 57%, 68%, or at least about 59%, 
5 In particular embodiments, the iroiypepttdss of the Invention i) have; or is) consist of 

an amino acid sequence with any of the degrees of identity as mentioned above. 

For the purposes of the present Invention, the degree of Identity between two amino 
acid sequences, as well as the degree of identity between two nucleotide sequences, is 
determined by the program "align" which is a Needleman-Wunsoh alignment (i.e. a global 

10 alignment). The program is used for alignment of polypeptide, as well as nucleotide 
sequences. The default scoring matrix BLQSUM§0 is used for polypeptide alignments, and; 
the default identity matrix is used for nucleotide alignments. The penalty for the first residue 
of a gap Is -12 for polypeptides and -16 for nucleotides. The penalties for further residues of 
a gap are -2 for polypeptides, and -4 for nucleotide. 

15 "Align" is part of the PASTA package version v20u8 {see W. R, Pearson and D. J. 

Llpman (1988), "improved Tools for Biological Sequence Analysis", PNAS 88:2444-2448, 
and W, Ft Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP end 
FASTA," Methods in Erisyrrioiogy 183:83-98), FASTA protein alignments use the Smith- 
Waterman algorithm with no limitation on gap size (see "Smith-Waterman algorithm", T> F, 

20 Smith and IVK S. Watemian (1 981 ) J, Mai Biol. 147:195^197). 

In a particular embodiment the mature peptide parts, or predicted or expected 
:mtm peptide parts, of the two amine acid sequences are used for the alignment in the 
alternative, that part of the sequence, whose Identity to the mature peptide part of SEQ ID 
NO: 2 mdior SEQ ID NO: 8 is being examined, is chosen, which according to a multiple 

26 alignment made as described below Is most similar to the mature peptide part of SEQ ID NO: 
2 and/or 8, 1, a the corresponding amino acid residues as identified by the multiple alignment, 
in the present context, the basis for numbering amino add residues or assigning 
position numbers, of. the second aspect of the invention, is SEQ ID NO: 2 starting with Ai 
and ending with T188, in the alternative, the basis is amino adds 1-188 of the protease 

30 derived horn Nocardiopsis sp. NRRL 18262 (SEQ ID NO: 1 as disclosed In WO 01/58276, 
preferably SEQ ID NO: 1 as disclosed in WO 01/68278 In which A87 is substituted with T87. 
Proteases may comprise extensions as compared to these reference sequences, e.g. SEQ: 
ID NO: 2, I.e. in the N-ferminai, and/or the C-iermina! ends thereof. The amino adds of such 
extensions, If any, are to be numbered as is usual in the art, i.e. for a O-terminal extension: 

35 189, 190, 1 91 and so forth, and for an hl^ermrial extension -1 , -2, ~3 and so forth 

For each amino add residue In each protease aligned to the reference sequence, e.g. 
SEQ ID- NO: 2, as explained above (for the purposes of detormining degree of identity), it is 
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possible to directly and unambiguously assign an amino add residue in the reference 
sequence, e.g. SEQ ID NO: 2, to which It Dormspdnds. Corresponding residues are assigned 
trse same position, or number, by reference to, e.g., SEQ ID NO; 2. 

For each amino acid residue In another protease, the corresponding position of the 
5 reference sequence, e.g. SEQ ID NO: 2, can be found, as fallows': 

The amino acid sequence of the other protease Is designated SEQ X. A position 
corresponding to position N of SEQ ID HO; 2 Is found as follows: SEQ X is aligned with SEO: 
ID NO: 2 as specified above. From tbe alignment, the position in sequence SEQ X 
corresponding to position N of SEQ JD HQ: 2 can be dearly and unambiguously derived, 
10 using the principles described beiow. 

SEQ X may be a mature pari of the protease In question, or It may also Include a 
signai peptide part, or it may be a fragment of the mature protease which has protease 
activity, e.g. a fragment of the same length as SEQ ID NO: 2, and/or it may be the fragment 
which extends from A1 to T188 when aligned with SEQ ID NO: 2 as described herein. 
15 Three alignments are inserted below as Tables I. II and III. The alignments were 

.prepared as described above, aligning the mature part of another protease (SEQ XI, SEQ 
X2, and SEO X8, respectively) to SEQ ID NO: 2. Approximately SO amino acid residues of 
each protease are shown. 

Looking first at the alignment of Table I, it is dear that, e.g., G42 of SEQ ID NO: 2 
20 corresponds to P42 of SEQ X1, as these residues are on top of each other in the alignment. 
They are both assigned number 42, vfe. the number of the corresponding residue in SEO ID 
NO: 2. It is also apparent from this alignment that, e.g., SEQ X1 does not comprise 10Y, but 
it does comprise 38T. 

25 Table I 

ABlXftS&Mft MGGKCSVGFA f?.T^SkGQP«3r YTAGHGGTVC SEQ W NO: 2 

miimiAx'i: MccRcsvsm mmt&m* wj^h^vs tfvsiomqg beq m 

10 £0 30 40 50 

Tables il and III are examples of alignments producing gaps in either of the two 
sequences. 

30 in the alignment of Table li, a gap Is produced in SEQ X2. The highlighted amino acid 

residue P of SEQ X2 is, for the present purposes, designated P28, although in SEQ X as 
such itisP25. 

T able II 



1? 



mxx<m&x$ umiics\Kw& jmss&mww tw»j» • >m<irim&m bbq m m* % 

10 20 -}G 40 SO 

in the alignment of Table 111* a gap is produced in SEQ X3. When a gap is produced 
between amino acids having position number -nn and {nmi} of SEQ ID NO; 2, each position 
5 of the gap is assigned a iower case or subscript ietier: a, b, c etc. to the former position 
number, I.e. nn. Accordingly, each position of the gap is numbered nha, nnb etc, The 
highlighted amino acid residue R of SEQ X3 is, for the present purposes, designated R33a, 
although in SEQ X3 as such it is R34, 



10 TableJU 

miXGGh&Y-X UQGRCSVOm ATSS^pPOF W&---- SHOOT VSTSSVTXOK'OTO' SBQ ID NO: 2 

MyXXBQhPC.iT MGCKCSVGFA ATK'A S QQPOF Vr&SSOKCOT VOTPVSIOKOQG BEQ X3 
10 20 38 40 50 

in further particular embodiments of ether aspect of the invention, the polypeptide of 
the Invention has a melting temperature fr JB ) of at least 75*0, 76*0, 77*0, 78*0, 79*0, 80^0, 
S1°C, 82°C, 83 : 0, 84*0, 85*0, 06*0, 87*0, 88*0, 80*0, 9G*D, 91*0, 92*C, 93*0, 94*0, Of at 
1 5 least 95°C, as determined by Differential Scanning Caierlrneirv (DSC). 

The DSC is performed in a 10 nM sodium phosphate, 50 : m$ sodiym chloride buffer, 

pH 7.0. 

The scan rate is constant, e,g. 1,S*C/mSn. 
The interval scanned may he torn 20 to 100 *C. 
20 There are no upper iimilatione on fee T,*,, however, if is presently contemplated that 

the T m may be below 150*0, 14S°0, 140*0, 135*0, 130*0, 125*0, 120*0, WO, 110*0, 
105*0, or below 100*0 

In an alternative embodiment, another buffer is selected for the scanning, e.g. a 
buffer of pH 5,0, 5.5, 6,0, or pH 6.5. 
25 in further alternative embodiments, a higher or lower scan rate may be used, e.g. a 

lower one of 1,4 s C/mfo, 1,3*G/min, 1.2 t5 C/min, 1.1^0/rain, 10°C/min, or 0.9 c O/mln. 

Reference Is made to Example 2 for further details about the scanning procedure, 
in a particular embodiment, the protease of the invention exhibits an amended 
temperature activity profile as compared to the protease derived from Nocardiopste $p. 
So NRRL 18262. Per example, the protease of the invention may exhibit a relative activity at pH 
0 and 80*0 of at least 0.40, preferably at least 0.45, 0.50, 0.55, 0.60, 0.66, 0.70, 0.75, 0.80, 
0.85, 0.00, or at least 0.05, fie term M feiative" referring to the maximum activity measured for 
the protease in question. For Protease 22, tie L2a protease, as well as for the protease 
derived from NocaMiopsss 0$$wx%tei' siMspBclm <temmiv®ei DSfvi 43235, the activity is 
35 relative to the activity at 80*O which Is set to 1.000 {100%), and for Protease 8, and the 
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protease derived from Nocardiopm sp, NRRL 18282, the actSytfy at 70°C f§ set to 1000 
(100%), see Examples 2, 7, 8, and 9, M another example, the protease of the Invention 
exhibits a relative activity at pH 9 and 90*0 of ai feast 0.10, preferably at least 0,15, 0.20, 
0,25, 0.30, or of at least 0,35, In a particular embodiment, the protease activity is measured 
§ using the Prtfeyme AK assay of ©cample 2. 

A screening for thermostable proteases miatad to SEQ ID NGs: 2, 8, 1 0 or 12 of the 
Invention may be performed as follows: Screening a DMA library with primers, e.g, either of 
SEQ ID NOs; 3 or 4 S or preferably with the mature peptide encoding parts of either of SEQ ID 
NOs: 1, 7, 9 or 11; expressing the hybridising clones in a suitable strain, e.g. a strain of 

10 B&cflius or E co//. In a next step, the expressed proteases related to either of SEQ ID NOs; 
2, 8, 10 and/or 12 are purified, preferably In a micro-purification process (sea e.g. WO 
03/037914), and the amount of active protease Is determined for each candidate by use of 
the well-known principle of active site titration (AST) with a strong inhibitor of the enzyme, to 
a subsequent step, a known amount of the enzyme is then Incubated at a desired elevated 

is temperature, e.g. 8S Q C, for a desired time, e.g. 2 hours; and the residual protease activity is 
deienrrsned, e.g, by use of any of the protease assays described herein, tor example the 
Proiaryme AK assay of Example 2, The major part of the procedure may be ■automated, 
and if desired performed with the assistance of robots. The verification of thermostability Is 
e.g, done by purification of the protease, and establishment of T m hy BSC as described In the 

20 experimental part herein, 

The present Invention also relates to the animal feed use of the polypeptides of the 
invention. 

The degree of identity between two amino acid sequences may also be determined 
by the Ciustal method (Higgiris, 1989, CA8IOS 5; 151-153) using the LASERGEIME^ 

2S MEG ALIGNS software (DNASTAR, tnc, Madison. Wl) with an Identity table and the 
following multiple alignment parameters; Gap penalty of 10, and gap length penalty of 10, 
Pairwise alignment parameters are Ktupie«1, gap penaiiy--3, wlndows~S : and dlagonais~8. 
The degree of identity between two nucleotide sequences may foe determined using the 
same algorithm and software package as described above with the following settings; Gap 

30 penalty of 10, and gap length penalty of 10, Pairwise alignment parameters are Ktupia~3, 
gap penaliywS and wIndows-20, 

In a particular embodiment, tire homologous polypeptides have an amino acid 
sequence that differs from amino acids 1 to 18$ of SEQ ID NO: 2 or -166 to 188 of SEQ ID 
NO; 2, or from amino acids 1 to 196 ©f SEQ ID NO; 8 or -192 to 193 of SEQ ID HO; 8, by 

35 (I) no more than 75, 74, 73, 72, 71, 70, 8S, 68, 87 f 66, 85, 64, 63, 62, 82, 60, 59, 58, 

57, 58, 50, 54, S3, 52, 51, 50, 49, 48, 47, 48, 48, 44, 43, 42, 41, 40, 39, 38, 37, 38, 35, 34, 
33, 32, 31, 30, 20, 28, 27. 26, 25, 24, 23, 22, 21 , or no more then 20 amino acids; 



(ft) no mors than -twenty, nineteen, eighteen, seventeen, sixteen, fifteen, fourteen, 
thirteen, twelve, or no more than eleven amino acids; 

(iii) no more than ten, nine, eight, seven, six, two, four, three, two, or no more than 
one amino add-; 

S (fv) fen, or by nine, or by eight, or by seven, or by six, or by five amino adds; or 

(y) four, or by three, or by two amino adds, or by one amino acid, 
in a particular embodiment, the polypeptides of the present invention comprise the 
amino acid sequence of the mature peptide parts of either of SEQ ID NOs: 2, 8, 10, or 12, or 
allelic variants thereof; or fragments thereof that have protease activity, 
10 in another preferred embodiment, the polypeptides of the present invention consist of 

the mature peptide parts of either of SEQ ID NQs: 2, 8, 10, or 12, or allelic variants thereof; 
or fragments thereof thai have protease activity. 

A fragment of the mature peptide parts of either of SEQ ID NOs: 2, 8, 10, or 12 is a 
polypeptide having one or more amino acids deleted from the amino and/or earboxyl 
1 5 terminus of these amino acid sequences. In one embodiment a fragment contains at least- 75 
amino acid residues, or at least 100 amino acid residues, or at least 125 amino acid 
residues, or at least 150 amino acid residues, or at least 160 amino acid residues, or at least 
16S amino acid residues,; or at least 170 amino acid residues, or at least 175 amino acid 
residues, 

20 An allelic variant denotes any of two or more alternative farms of a gene occupying 

the same chromosomal locus. Allelic variation arises naturally through mutation, and may 
result in polymorphism within populations. Gene mutations can be silent (no chahg© In the 
encoded polypeptide) or may encode polypeptides having altered amino acid sequences. An 
allelic variant of a polypeptide is a polypeptide encoded by an allelic variant of a gene, 

2S The present invention also relates to Isolated polypeptides having protease activity and 
which are encoded by nucleic acid sequences which hybridise under very low, or low, or 
medium, or medium-high, or high, w very high stringency conditions with a nucleic acid 
probe which hybridizes under the same conditions with (a) nucleotides 499-1062 of SEQ SD 
NO; 1, nucleotides 577-1164 of SEQ ID NO; 7, nucleotides 586-1152 of SEQ ID NO: 9, 

30 and/or nucleotides 502-1085 of SEQ ID NO: 11; (b) a subsequence of (a), or (c) a 
complementary strand of (a), or .(b) (J, Sambrook, E.F. Frifsch, and T, fenlatis, 1989, 
Molecular Cloning, A Laboratory Manual, 2nd edition, Cold Spring Harbor, Mew York), in one 
particular embodiment the nucleic acid probe Is selected from amongst the nucleic acid 
sequences of (a), (b)., or (c) above. The nucleotides of (a) correspond to the mature peptide 

35 encoding parts. 



20 



wo 1 1 tin H:rm%2imtimm2 
The subsequence of the nucleotides mentioned under (a) above may he at least 100 

nucleotides, or m another embodiment at least 200 nucleotides. Moreover, the subsequence 

may encode a polypeptide fragment that has protease activity. 

The nucleic acid sequences of (a) or (b) above, as well as the corresponding mature 

§ amino acid sequences of SEQ ID NOs: 2, 8, 10, or 12, or' a fragment thereof, may be used to 

design a nucleic acid probe, to Identify: and clone DNA encoding polypeptides having 

protease activity from strains of different genera or species -according to methods well known 

In the art. In particular, such probes can be used for hybridization with the genomic or cDNA 

of the genus or species of interest, following standard Southern blotting procedures, in order 

10 to identify and Isolate the corresponding gene therein. Such probes can be considerably 
shorter than the entire sequence, but should be at least 15, preferably at least 25, and more 
preferably at least 35 nucleotides in length. Longer probes can also be used. Both DIM A and 
RNA probes can he used. The probes are typically labeled for defecting the corresponding 
gene (for example/with ^P, 3 H S S5 S, biofln, or avldio), Such probes are encompassed by tie 

15 present invention.- 

Thus, a genomic DNA or cDNA library prepared from such other organisms may be 
screened for DNA that hybridizes with the probes described above and which encodes a 
polypeptide having protease activity. Genomic or other DNA from such other organisms may 
be separated by agarose or poiyaoryiamide gel electrophoresis, or other separation 

M techniques. DNA from the libraries or the separated DNA may be transferred to and 
immobilized on nitrocellulose or other suitable center material. In order to identify a clone or 
DNA which is homologous with either of SEQ ID NOs; 1. ?, 9, or 11, or a subsequence 
thereof, the carper material is used in a Southern blot. For purposes of the present Invention, 
hybridization indicates that the nucleic acid sequence hybridises to a labeled nucleic acid 

25 probe corresponding to the sequences of (a), (b) or (c) above, under very low to very high 
stringency conditions, Molecules to which the nucleic acid probe hybridises under these 
conditions are detected using X~ray film. 

In a particular embodiment, the nucleic acid probe is a nucleic acid sequence which 
encodes amino acids 1 to 188 of SEQ ID HO: 2, 1 to 198 of SEQ ID NO; S, 1 to 189 of SEQ 

30 ID NO; 10, and/or 1 to 1SS of SEQ ID NO: 12, or subsequences thereof, in another 
embodiment the nucleic acid probe is nudeotides 400-1062 of SEQ 10 NO: 1, §77-1154 of 
SEQ ID NO: 7, 886-1152 of SEQ SO NO; 9, and/or 502-1965 of SEQ ID NO; 11 (the mature 
polypeptide coding regions). 

For long probes of at least 100 nucleotides in length, very low to very high stringency 

35 conditions are defined as prehyhridteatlon and hybridization at 42X In SX SSPE, 0.3% SDS, 
200 pg/mi sheared and denatured salmon sperm DNA, and either 25% formamfde for very 
low and low stringencies, 35% formamlde for medium and medium-high stringencies, or 50% 



formamlde for high and wry high stringencies, following standard Southern blotting 
procedures. 

For long probes of at least 100 nucleotides In jahgtfh the carrier materia! Is finally 
washed three times each for 18 minutes using 0,2: x SSC, 0.2% 8DS, 20% formamlde 
s preferably at least at 48*0 (very low stringency}, more preferably at least at 50°C (low 
stringency), more preferably at least at $$% (medium stringency), more preferably at least at 
60*C (medium-high stringency}, even more prefembly at feast at 05°C (high stringency}, and 
most preferably at least at ?G°C (very high stringency}. 

For short probes about 15 nucleotides to about 70 nucleotides in length, stringency 

10 conditions are defined as prehybridlzaiion, hybridization, and washing post-hybridisation at 
§°C to 1Q°C below the calculated T«, using the calculation according to Bolton and McCarthy 
(1962, Proceedings of the National .Academy of Sciences USA .48:1390) in 0,9 M Nad, 0.09 
M Tris~HCi pH 7,6, 8 mU BUT A, 0,S% HP-4Q, IX Denhardfs solution, 1 mM sodium 
pyrophosphate, 1 mM sodium monobasic phosphate, 0.1 niM ATP, and 0.2 mg -of yeast RNA 

1 S per m! following standard Southern blotting procedures. 

For short probes about 16 nucleotides to about 70 nucleotides in length, the carrier 
material is washed oriee in 8X SSC plus 0,1% SDS for IS minutes and twice each for 15 
minutes using 8X SSC at S*C to 10*C below the calculated T w . 

The present Invention also relates to variants of the polypeptide having an amino acid 

20 sequence of amino acids 1 to 188 of SEQ I'D NO: 2, 1 to 196 of SEQ ID NO: 8, 1 to 189 of 
SEO ID NO; 10, and/of 1 to 188 of SEQ -ID NO: 12, comprising a substitution, deletion, 
and/or insertion of one or mors amino acids. 

The amino add sequences of the variant polypeptides may differ from the amino acid 
sequence of amino acids 1 to 188 of SEQ ID NO: 2, 1 to 196 of SEQ ID NO: 8, 1 to 189 of 

25 SEO ID NO: 10, and/or 1 to 186 of SEQ ID NO: \% by an insertion or deletion of one or more 
amino acid residues and/or the substitution of one or more amino acid residues by different 
amino acid residues, in a particular embodiment, amino add changes are of a minor nature, 
that is conservative amino add substitutions that do not significantly affect the folding and/or 
activity of the protein; small deletions, typically of one to about 80 amino acids; small amino- 

30 or carboxyMerminai extensions, such as an amino-terminal methionine residue; a small 
peptide of up to about 20-28 residues; or a small extension that facilitates purification by 
changing- net charge or another function, such as a pcly-histidine tract, an antigenic epitope 
or a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids 
35 (arginine, lysine and histidine), addle amino acids (glutamic acid and aspartie acid), polar 
amino acids (glutamine and asparagirie), hydrophobic amino adds (leucine, isoleueine and 
valine}, aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino 
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adds (glycine, alanine, serine, threonine and methionine). Accordingly, for example, the 
invention relates to a polypeptide having* or comprising, a sequence as set forth in the 
mature peptide parts of either of §BQ ID NOs: 2, 0, TO, or 12, wherein conservative amino 
add substitutions comprise replacements, one for another, among the basic amino acids 
8 (arginine, lysine and hlstidine), among the aeicife amino acids, (glutamic acid and aspartic 
add), among the polar amino acids (glutemine and asparagirte), among the hydrophobic 
amino acids (alanine, leucine, isaieuelne, and valine), among the aromatic amino acids 
(phenylalanine,, tryptophan and tyrosine), and among the smaii amino acids {glycine, alanine, 
serine, threonine and methionine), or any combination thereof or active fragments thereof, 

io Amino add substitutions which do not generally alter the specific activity are known in the art 
and are described, for example, by H, Neurafh and RL Hill, 1978, in, The Proteins, 
Academic Press, New York, The most commonly occurring exchanges are Ala/Ser, Val/lie, 
Asp/Giu, Thr/Ser, Ala/Gly Aia/Thr, SertAsn, Ala/Val, Ser/Giy, Tyr/Phe, Ala/Pro, Lys/Arg, 
Asp/Asn, leu/He, LeuA/ai, Aja/Giu, and Asp/Gly as well as these in reverse, 

is in a particular embodiment, the polypeptides of the Invention and for use according to 

the invention are acld-sfabie. For the present purposes, the term acid-stable means that the 
residual activity after 2 hours of incubation at pH 2.0., pH 2,5 or pH 3.0 and 37°C, Is at (east 
50%, as compared to the residual activity of a corresponding sample incubated M 2 hours at 
pH 9,0 and S*C. in a particular embodiment, the residual adfvtty is at feast W%> 70%, 80% 

20 or at least 80%. A suitable assay for determining acid-stability is the pH-stability assay of 
Example 2, 

in another particular embodiment the polypeptides of the invention and for use 
according to the invention have a relative activity at pH 7,0 of at least 0,10, 0,15, 0.20. 0.25, 
0.30 or at least 0,85. The phi-profile test of Example 2 Is used for the determination. 

25 in still further particular embodiments, the polypeptides of the Invention and for use 

according to the invention haw I) a relative activity at 60"C and pH 9 of at least 0,05, 0,10, 
0,1 S or at least 0,19; and/or is) a relative activity at ?0*C of at least 0,40, 0.50, or at least 
0.60. The temperature-profile test of Example 2 is used for these determinations. 

The polypeptide of the invention and for use according to the invention may be a 

30 bacterial or fungal polypeptide. The fungal polypeptide may he derived from a filamentous 
fungus or from a yeast. 

In particular embodiments, the polypeptide of the invention is I) a bacteria! protease; 
ii) a protease of the phylum AMnobaaterim ill) of the class Acimobactem; Iv) of the order 
AciinomycetalQS v) of the family Mo^afd/opaaceae; vl) of the genus Mocarofeps/s; and/or a 

35 protease derived from vit> NooBrtiiopsis species such as 'Naoantjopsis a/da, Noc&rdiopsls 
alksfiphM, NocBrdiopsls ant&mtim s Noc8rdiopsl& pms/na, NocsrdlQpsIs composta, 
Noc&rdmpBSS exhafens, Noc&rdiopsm haiophiia, Nocardi&pB® haiatolarans, Nocardhpsis 



kmsammm, Nocardiopsis (Mart, Nmardtaps® jmaaiam®, Nocardiopsis metafhms, 
NocBrdiopsis ^n^mMaimm^ ■N^r^s^s-p^^,- Nd&af^ip^-lf^ka, Nocmrdiopsb 
wmdmcholae, Nocardiopsis xmjmngensis, or Nmardiopsis dassomiiM, for example 
Nocardiopsis dassonviltef subsp, <fassmv®ei,DSM 43235, Nmedk^sis sp, DSM 18424, or 
S Nomrdiopsis alba DSM 1 §647: such as a. polypeptide with : eitfcer of the mature amino acid 
sequences of SEQ ID NOs: 2, 8, 10, and- 12, Hi a particular embodiment the protease 
derives from Nocardiopsis alba, Noowdiopmmtafptica, or Wmrdiopsis dassonvitiei 

The above taxonomy is according to fie chapter: The road map to the Manual by 
G.M. Sanity & ,j. g, Hoff in Sergey's Manual of Systematic Bacteriology, 2001, second 

10 edition, volume 1 , David R Bone, Richard W. Castenhoiz. 

It wf be understood that for the aforementioned species, the Invention encompasses 
both the perfect and imperfect stales, and other taxonomic equivalents, e.g., anamorphs, 
regardless of the species name by which they are known, Those skilled in the art Will readily 
recognize the identity of appropriate equivalents.- 

15 Strains of these species are readily accessible to the public in a number of culture 

collections, such as the American Type Culture Collection (ATCC), Deutsche Sammf«n§ von 
Mikroorganismen und Zelikulturen GmbH {DSM}, Centraaiburoau Vddr Schimmelcukures 
(CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional 
Research Center (NRRL), 

20 Furthermore, such polypeptides may be Identified and obtained from other sources 

including microorganisms Isolated from nature (e.g., sell, composts, water, etc) using the 
above-mentioned probes. Techniques for isolating microorganisms from natural habitats are 
Well Known is the art. The nucleic acid sequence may then be derived by similarly screening 
a genomic or cDNA library of another microorganism. Once a nucleic add sequence 

25 encoding a polypeptide has been detected with the probe(s), the sequence may be isolated 
or cloned by utilizing techniques which are /known to those of ordinary skill In the art (see, 
e,g. s Sambrook et al M 1080, supra). 

As defined herein, an "Isolated" polypeptide is a polypeptide which is essentially free 
of other polypeptides, e.g., at least about 20% pure, preferably at least about 40% pure, 

30 more preferably about 80% pure, even more preferably about 80% pure, most preferably 
about 90% pure, and even most preferably about 95% pure, as determined by SDS-PAGE, 

Polypeptides encoded by nucleic acid sequences of the present invention also 
include fused polypeptides or deavable fusion polypeptides in which another polypeptide is 
fused at the M-terminus or the C4atmiuus af file polypeptide or fragment thereof, A fused 

3S polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) encoding 
another polypeptide to a nucleic acid sequence (or a portion thereof) of the present invention. 
Techniques for producing fusion polypeptides are known in the art, e.g. PCR, or ligating the 



coding sequences encoding the polypeptides sc that they are hi frame and that expression of 
the fused polypeptide is under control of the same promoter£s) and termi nator. 

In sill further particular embodiments, the invention excludes one or more of the 
proteases derived from (f) NomrdhpsiB dmsomiHBl NRRL 18133 which Is disclosed in WO 
6 88/03947; (Is) Noamitepsis sp. strain QPG-210 (FE^ P-10508) which is disclosed in JP 2~ 
255081 -A ; p) strain ZIMET 4384? -of speeiss .f4btim$w$te d&ssonvilf&s which is 
disclosed in DO 200432|8; (iv> Nmardiopsm&p, TOM (FERIVI4M 8878), which is disclosed 
In JP 2000284571; and/Or (v) the corresponding PNA> 

Nucleic Acid Sequences 

10 The present invention also relates to isdafed nucleic acid sequences thai encode a 
polypeptide of the present invention. Particular nucleic acid sequences of the Invention are 
nucleotides 499-1082 of SEQ ID NO: 1, §77-1184 of SEQ ID NO: 7, 588-1152 of SEG ID 
NO: 9, and/or 502-1065 of SEQ ID NO: 11; corresponding to the mature polypeptide 
encoding regions of SEQ ID NOs; 2, 8, 10, and 12, respectively. The present invention also 

ig encompasses nucleic acid sequences which encode a polypeptide having the amino acid 
sequence of the mature peptide parts of either of SEQ ID NOs; 2, 8, 18, and 12, but which 
differ from tie corresponding parts of SEQ ID NOs; 1, 7, 9, and 11, respectively, by virtue of 
the degeneracy of the genetic cede. The present invention also relates to subsequences of 
SEG ID NOs: 1, 7, 9, and 11, which encode fragments of SEQ ID NOs; 2, 8, TO, and 12, 

20 .respectively, which have protease activity. 

A subsequence of SEQ ID NOs: 1, 7, 9, and 11, is a nucleic acid sequence 
encompassed by SEQ ID NOs; 1, 7, % and 11, respectively, except that one or more 
nucleotides from the S J and/or 3' end has been deleted. Preferably, a subsequence contains 
at least 228 nucleotides, more preferably at least 300 nucleotides, even more preferably at 

2S least 375, 450, 500, 531 , 800, 700, 800, 900 or 1000 nucleotides. 

The present invention also relates to nucleotide sequences which have a degree of 
Identity to nucleotides 409-1082 of SEG ID NO: 1, 577-1184 of SEQ ID NO: 7, 586-1152 of 
SEQ ID NO: 9, and/or 502-1085 of SEQ ID NO: 11 of af (east 80%. in particular 
embodiments, the degree of identity to either of these nucleotides is at least 62%, 64%, 86%, 

30 88%, 70%, 72%, 74%, 78%, 78%, 80%, 81%, 82%, 83%, 84%, 35%, 88%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 98%, 97%, 98%, or at least 99%. 

The present invention also relates to mutant nucleic acid sequences comprising at 
least one mutation In fhs mature peptide encoding parts of either of SEQ ID NOs: 1, 7, 9, or 
11, In which the mutant nucleic acid sequence encodes a polypeptide which (!) consists of 

35 the mature peptide parte of either of SEG ID NQs: .2, 8, 10, and 12, 'respectively, or {si} is a 
variant of any of the sequences of d}, : Wherein the variant comprises a substitution, deletion, 
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and/or insertion of one or more amino aelits, or :(IH) is an allelic variant of any of the 
sequences of (I), or (iv) is a fragment -of any of ma sequences of (0, 

The techniques used to isolate or clone a nucleic acid sequence encoding a 
polypeptide are known In the art and ineluda isolation from genomic DNA, preparation from 
5 cDNA, or a combination thereof. The cloning of the naslolo add sequences of the present 
invention from such genomic ONA can bo effectad, e#, by using the well known polymerase 
chain reaction (FCR) or antibody screening of expression libraries to detect cloned DMA 
fragments with shared structural features, See, e.g., innis et at, 1990, PGR: A Guide to 
Methods and Application, Academic Press, Mew York, Other nucleic add amplification 

10 procedures such as ligase chain reaction (ICR), ligated activated transcription (LAT) and 
nucleic acid sequence-based amplification- (MASBA) may be used, The nucleic acid 
sequence may be cloned from a strain of Noeardiapsis or another or related organism and 
thus, for example, may be an allelic or species variant of the polypeptide encoding region of 
the nucleic add sequence, 

18 The term 'Isolated nucleic acid sequence" as used herein refers to a nucleic acid 

sequence which is essentially free of other nucleic acid sequences, e.g., at least about 20% 
pure, preferably at least about 40% pure, more preferably at least about 80% pure, even 
more preferably at feast about 00% pure, and most preferably at least about 90% pure as 
determined by agarose electrophoresis. For example, an isolated nucleic add sequence dan 

20 be Obtained by standard cloning procedures used in genetic engineering: to relocate the 
nucleic acid sequence from its natural location to a different site where it will be reproduced. 
The cloning procedures may Involve excision and isolation of a desired nucleic acid fragment 
comprising the nucleic acid sequence encoding the polypeptide, insertion of the fragment 
Into a vector molecule, and incorporation of the recombinant vector into a host cell where 

25 multiple copies or clones of the nucleic acid sequence will be replicated. The nucleic acid 
sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any 
combinations thereof, 

Modification of a nucleic acid sequence encoding a polypeptide of the present 
Invention may be necessary for the. synthesis of polypeptides substantially similar to the 

30 polypeptide. The term "substantially similar* to the polypeptide refers to non-naiuraliy 
occurring forms of the polypeptide- These polypeptides may differ in some engineered way 
from the polypeptide isolated from its native source, e.g., variants that differ in specific 
activity, thermostability, pH optimum, alfergenidty, or the like. The variant sequence may be 
constructed on the basis of the nudeic acid sequence presented as the polypeptide encoding 

35 part of SEQ ID NOs: 1, 7,9 and/or 11* e.g.* a subsequence thereof, and/or by introduction of 
nucleotide substitutions which dd hot give rise to another amino acid sequence of the 
polypeptide encoded by the nucleic acid sequence, but which correspond to the cation usage 
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of the host organism intended for production of the protease, or by introduction of nucleotide 
substitutions which may give rise to a dtfferenl amino acid sequence. For a general 
description of nucleotide substitution, see, e.g,, Ford ©t al„ 1991, Protein Expression and 
Purification 2; 95-107. Low-aliergenfe polypeptides can a,g. be prepared as described above. 

It will be apparent to these skied In the art that such substitutions can be made 
outside the regions critical to the fendion of the molecule and stlf result in an active 
polypeptide, Amino add residues essentiatto the activity of the polypeptide encoded by the 
isolated nucleic add sequence of the invention, and therefore preferably not subject to 
substitution, may be identified according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (see, e.g., Cunningham and Wells, 
1989, Science 244: 1081-1085). In the latter technique, mutations are introduced at every 
positively charged residue in the molecule, and the resultant mutant molecules are tested for 
protease activity to identify amino acid residues that are critlbal to the activity of the molecule. 
Sites of substrate-protease interaction can also he determined by analysis of the three- 
dimensional structure as determined by such techniques as oudear magnetic resonance 
analysis, crystallography or photoaffinlty labelling (see, eg., de Vos et af., 1992, Science 
255: 306-312; Smith st aL. 1992, Journal of Molecular Bioiogy 224: 899-904; VVIodaver et ai„ 
1992, FEBS Letters 309: 59-84). 

The present invention also relates to isolated nucleic acid sequences encoding a 
polypeptide of the present invention, which hybridise under very sow stringency conditions * 
preferably low stringency conditions, more preferably medium stringency conditions, more 
preferably medium-high stringency conditions, even more preferably high stringency 
conditions, and most preferably very high stringency conditions with a nucleic add probe 
which hybridizes under the same conditions with tie nucleic add sequence, preferably the 
mature peptide encoding part, of either of SEQ ID NOs: 1, 7. 9, or 1 1 , or their complementary 
strands; or allelic variants and subsequences thereof (Sambrook et al„ 1989, supra), as 
defined herein. 

The present invention also relates te« isolated nucleic acid sequences produced by (a) 
hybridizing a DMA under very low, low, medium, medium-high, high, or very high stringency 
conditions with (!) nucleotides 499-1062 of SEQ ID HO: 1, 577-11 84 of SEQ ID NO; 7, 588- 
1152 of SEQ ID NO; 9, and/or 502-1085 of SEQ ID HQ: 11, a subsequence of (i% or (ill) a 
complementary strand of % or 0i); and (b) isolating the nucleic acid sequence. The 
subsequence is preferably a sequence of at least 100 nucleotides such as a sequence that 
encodes a polypeptide fragment which has protease adiviy. 

Methods for Producing !V1 slant Mucleic Acid Sequences 
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The present Invention further relates to metfio^s for producing a mutant nucleic add 
sequence, comprising introducing at least one mutation solo tiia mature polypeptide coding 
sequence of either of SEQ ID NOs: i, 7, 9. or If, or a subsequence thereof, wherein the 
mutant nucleic add sequence encodes a polypeptide which consists of the mature peptides 
5 of either of SEQ ID NOs: 2, 8, 10, or 12, respeetiyeiy; -or a fragment thereof which has 
protease activity. 

The introduction of a mutation into the nudaie .add sequence to exchange one 
nucleotide for another nucleotide may he accomplished by site-directed mutagenesis using 
any of the methods known in the art Partieulady Useful is the procedure that utilizes a. 

is supercolied, double stranded DMA vector vffli an insert of Interest and two synthetic primers 
containing the desired mutation. The oligonucleotide primers, each complementary to 
opposite strands of the vector, extend during temperature cycling by means of Pfu DNA 
polymerase. On incorporation of the primers, a mutated plasmid containing staggered nicks 
is generated. Following temperature cyciing, the product is treated with Dpnl which is specific 

is for methylated and hemlmethylatad DNA to digest the parental DNA template and lo selecl 
for mutation-containing synthesized DNA.. Other procedures known in the art may also be 
used. 

Nucleic Acid Constructs 

.20 The present Invention also relates to nucleic add constructs comprising a nucleic add 

sequence of the present Invention operahly linked to one or more control sequences that 
direct the expression of the coding sequence in a suitable host cell under conditions 
compatible with the control sequences. Expression will he understood to include any step 
involved in the production of the polypeptide including, but not limited to, transcription, post- 

26 trahscriptionalmoditication, translation, post-translationa! modification, and secretion. 

"Nucleic acid construct" is defined herein as a nucleic acid molecule, either single- or 
double-stranded, which Is isolated from a naturally occurring gene or which has been 
modified to contain segments of huciefe add combined and Juxtaposed in a manner that 
would not otherwise exist In nature. The term nucleic acid construct is synonymous with the 

30 term expression cassette when the nucleic acid construct contains all the control sequences 
required for expression of a coding sequence of the present invention. The term "coding 
sequence* is defined herein as a nucleic acid sequence that directly specifies the amino add 
sequence of Its protein product. The boundaries of the coding sequence are generally 
determined by a dbosome binding site (prekaryot.es) or by the ATG start codon (eukaryotes) 

35 located just upstream of the open reading frame at the 5 ! end of the mRNA and a 
transcription terminator sequence located just downstream of the open reading frame at the 
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3* end of the mRNA. A coding sequence can include, but is not limited to, DMA, cDNA, and 
recombinant nucleic acid sequences..: 

An Isolated nucleic acsd sequeifioe encodiUg a poiypepiido of the present invention 
may bo manipulated in a variety of ways to provide for expression of the polypeptide. 
S Manipulation of the nucleic acid sequence prior to Its insertion into a vector may be desirable 
or necessary depending on the expression vector. The techniques for modifying nucleic acid 
sequences utilizing recombinant DHA methods are well known In the art 

Tiie term "control sequences* 'b. defined herein to include all components that are 
.necessary or advantageous for the expression of a polypeptide of the present invention. 

to Each control sequence may be native or foreign to the nucleic add sequence encoding the 
polypeptide. Such control sequences Include, but are not limited to, a leader, poiyadenylatlon 
sequence, propeptide sequence, promoter, signal peptide sequence, and transcription 
terminator. At a minimum, the control sequences include a promoter, and transcriptional and 
translator! stop signals. The control sequences may be provided with linkers for the 

15 purpose of Introducing specific restriction sites facilitating ligation of the control sequences 
with the coding region of the nucleic acid sequence encoding a polypeptide. The term 
"oparahly linked" is defined herein as a configuration in which a control sequence is 
appropriately placed at a position relative to the coding sequence of the DMA sequence such 
that the control sequence directs the expression of a polypeptide. 

20 The control sequence may be an appropriate promoter sequence, a nucleic acid 

sequence that is recognised by a host cell for expression of the nucleic acid sequence. The 
promoter sequence contains transcriptional control sequences that mediate the expression of 
the polypeptide, The promoter may be any nucleic acid sequence which shews 
transcriptional activity in the host ceil of choice including mutant, truncated, and hybrid 

W promoters, and may be obtained from genes encoding extracellular or intracellular 
polypeptides either homologous or heterologous to the host ceil. 

Examples of suitable promoters tor directing the transcription of the nucleic acid 
constructs of the present invention, especially In a bacteria! host cell, are the promoters 
obtained from the £ col/ lac operon, Stmpiomyc&s coe/?oo/or agarase gene (dagA), BaotHm 

30 subtle ievansucrase gene (sacB), BacHiim Mhanifonms aipha-amyiase gene (amyl), 
Sac/I/us siearoth&rmophBus maitogenic amylase gene (amyM), Sac/fos amyfo//quefaefer?s 
aipha-amyiase gene (amyQ), Ba&fflus MchenMonms penicillinase gene (penP} ; Backus 
subfflis xytA and xylS genes, and prokaryetie beta-iaeiamase gene {Via-Karoaroff et at, 
1S78, Proceedings of the Mstionai Academy of Sciences USA 7% 3727-3731), as well as the 

35 tac promoter (DeSoer at ai, ; 1S83, Proceedings of the National Academy of Sciences USA 
80: 21-25). Further promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific Amencan. 1980, 242: 74-94; and in Samhrookefat., 1989^ supra. 

oo 



Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present Invention in a filamentous fungal host eel! are promoters obtained 
from the genes for Aspergillus wyzm TAKA amylase., Rhizomumr m7e.be/ asparile 
proteinase, Aspergillus mgw neutfa! alpha-amyiasa, Aspergillus nigw acid stable alpha- 
5 amylase, Asp&rgfflm aig&r or AspergMuMmmrnQfi gluceamyfase (glaA), Rhlzomucor miehel 
lipase, Aspergillus oryzae alkaline protease, AspargMm oryzBs those phosphate Isomerase, 
Aspergillus niduiaos aeetarofdas®, and Fusarium oxyspomn trypsin-like protease (WO 
98/00787), as well as the NA2-tpl promoter (a hybnd of the promoters from the genes for 
Asp&rgSias nig&r neutrai aipha-amylase ■^A£p®^tf§--0ryzm irk*&& phosphate Isomerase), 

10 and mutant, truncated, and hybrid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Smchatvmyms 
cetdmim enelase (ENO-i), Ssccharomyc&s c&revisiae gaiaetoklnase (GAL1), 
SdGGharomyces cemvisiao alcohol dehydrogenase/glyosraldehyde-3-phosphsaf& dehydro- 
genase (ADH2/GAP), and Ssocharomyces cemvisiae S-phosphogiycarate kinase. Other 

15 useful promoters for yeast host cells are described by Romanes et af„ 1892, Yeast 8: 423- 
488. 

The control sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a host ceil to terminate transcription. The terminator sequence is 
operably linked to the 3* terminus of the nucleic acid sequence encoding the polypeptide. 
20 Any terminator which is functional in the host ceil of choice may be used In the present 
invention, 

Preferred terminators for filamentous fungal host cells are obtained from the genes 
for Aspmgiilus oryzae TAKA amylase, Aspergillus nigar giucoarnyiass, Aspergillus mduians 
anthranilaie synthase, Aspergillus nigm'&lphB-glmQsms^ and FmmimoxtfspQrum trypsin- 
25 like protease. 

Preferred terminators for yeast host cells are obtained from the genes for 
Saccharemyoes mmyislm enoiase, SacoMromyces csmvisim cytochrome C (CYC1), and 
Saccframnnycas cerevtsiae giyseraldehyde^hosphate dehydrogenase. Other useful 
terminators for yeast host ceils are described by Romanos et at, 1 982, supra. 

30 Preferred terminators for bacteria! host eeiis, such as a Bacillus host cell, are the 

terminators from BaciSm iletemfyrmis afpha-amylase gene (amyl), the MmiHus 
sieamtlmmophiius matogenic amylase gene; fiaroyM),-. or the Bacillus amyfoiiqmfmiem 
alpha-amyiase gene (amyQ), 

The control sequence may aiso be a suitable leader sequence, a nontranslated 

35 region of an mRNA which is important for translation by fee host cell The leader sequence Is 
operabiy linked to the §' terminus of the nucleic acid sequence encoding the polypeptide. 
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Any leader sequence that is functional in the host eel! of choice may be used In the present 
invention. 

Preferred leaders for filamentous fungal host cells are obtained from the genes for 
Aspergillus otyme TAKA amylase and Asp&rgllim nkliMhs those phosphate isometase.. 
5 Suitable leaders for yeast host cells are obtained from the genes for Sascttaromyc&s 

c&r&vime enolase (ENOf} s S&ccharomyms cetwlsim 3-phosphoglyeerate kinase, 
SBCGitawmyms mmtisfaa alpha-factor, and Smcharomyces cemvlsiae alcohol 
dahydrogenase/^lyoe^aldehyde-3-phQs^a^ g#jydrog$nas'e (ADH2/GAP). 

The control sequence may also be a poiyadenySaiion sequence, a sequence operabiy 
10 linked to the 3' terminus of the nucleic acid sequence and which, when transcribed; Is 
recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. 
Any poiyadenyiaiton sequence which is functional In the host cell of choice may be used in 
the present invention. 

Preferred pofyadenylation sequences for filamentous fungal host cells are obtained 

13 from the genes for Aspergillus on/see TAKA amylase, Aspergillus nig&r glyeosnlylass, 
Aspergillus nldulms anthreniiate synthase, Fusmium oxysporum &ypsSn*fike protease, and; 
A$p®rp$m .o^eralpha-glucosidase. 

Useful polyadenylation sequences for yeast host colls are described by Gup and 
Sherman, 1995, Molecular Ce/Mar &/o/ogy 15; 5983-5990, 

20 The control sequence may also be a signal peptide coding region that codes for an 

amino acid sequence linked to the amino terminus of a polypeptide and directe the encoded 
polypeptide into the cell's secretory pathway. The 5- end of the coding sequence of toe 
nucleic acid sequence may Inherently contain a signal peptide coding region naturally linked 
In translation reading frame with the segment of the coding region which encodes the 

25 secreted polypeptide. Alternatively, the 5 s end of the coding sequence may contain a signal 
peptide coding region which is foreign to the coding sequence. The foreign signal peptide 
coding region may be required where the coding sequence does not naturally contain a 
signal peptide coding region. Alternatively, the foreign signal peptide coding region may 
simply replace the natural signal peptide coding region In order to enhance secretion of the 

30 polypeptide. However, any signal peptide coding region which directs the expressed 
polypeptide into the secretory pathway of a host ceil of choice may be used in the present 
invention.. 

Effective signal peptide coding regions for bacterial host ceils are the signal peptide 
coding regions obtained from the genes for Bacillus NClB 11837 maitogenic amylase, 
35 Bacillus stmtolftmmepMm alpha-amylase, Bacillus :Mmffi>tmi$ suhtlssn, Bacillus 
Ikiherfiformss alpha-amylase. Bacillus smmtrmrmophiim neutral proteases (nprT, nprS, 
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nptU), and Bacillus, mbiilis prsA, Further signal peptides are described by Stmonan and 
Palva, 1993, Microbiological Reviews ST: 109-137. 

Effective signal peptide coding regions for filamentous fungal host ceils are the signal 
peptide coding regions obtained from tie genes for AspergHius oryzae TAKA amylase, 
5 Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor rmWei 
aspartic proteinase, Huffismi& InSQi&ns dejlulase, and Humlmla lanuginosa lipase. 

Useful signal peptides for yeast host cells are obtained from the genes for 
Sacchammyc&s csravmlae alpha-factor and SaccPammyces mm/lssm invertase. Oilier 
useful signal peptide coding regions ate described by Romanes etaL, 1992, supra, 
to The control sequence may also be a propeptide coding region that codes for an 

amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 
polypeptide Is known as a proenzyme or propoiypaptide (or a zymogen in some cases), A 
propolypepiide is generally inactive and can be converted to a mature active polypeptide by 
catalytic or autoeataiytie cleavage of the propeptide from the propolypeptide. The propeptide 
15 coding region may be obtained item the genes for Bacillus subtllis alkaline protease (ap?E} ( 
Bacillus subtilis neutral protease (nprT), SaeshammycBS oemvkkw alpha-factor, jRftlmmucar 
miene/asparfic proteinase, and Myeeiiopbiham thermophiia iaecase (WO 95/33838), 

In a preferred embodiment, the propeptide coding region is nucleotides 1493 of SEQ 
ID NO: i which encode amino acids -188 to --1 of SEQ ID NO: 2, nucleotides 82-576 of SEQ 

29 IP NO; 7 which encode amino adds -185 to -1 of SEQ ID NO: 8, nucleotides of : SHO 
ID NO: 9 which encode amine adds -188 to ~1 of SEQ ID NO: 10, and nucleotides 1-501 of 
SEQ 10 NO: 1 1 which encode amino acids -18? to ~1 of SEQ ID NO: 12. 

in -a preferred embodiment the signal peptide coding region is nucleotides 1-81 of 
SEQ ID NO: 7 which encode amine acids 1-20 of SEQ ID NO; 8 : or nucleotides 1-87 of SEQ 
2S ID NO; 9 which encode amino acids 1-29 of SEQ ID NO: 1 9. 

Where both signal peptide and propeptide regions are present at the amino terminus 
of a polypeptide, the propeptide region is positioned next to the amino terminus of a 
polypeptide and the signal peptide region is positioned next io the amino terminus of the 
propeptide region, 

30 ft may also he desirable to add regulatory sequences which allow the regulation of 
the expression of the polypeptide relative to the growth of the host cell, Examples of 
regulatory systems are those which cause the expression of the gene to he turned on or off 
in response to a chemical or physical stimulus. Including the presence of a regulatory 
compound. Regulatory systems lh pmferyofjc systems include the iac, tac, and trp operator 

35 systems. In yeast; the ADH2 system or GAL1 system may be used, In filamentous fungi, the 
TAKA alpha-amylase promoter, Aspergillus m$Br giucoamylase promoter; and Aspergillus 
oryzae giucoarnyiaso promoter may be used as regulatory sequences. Other examples of 



regulatory sequences are those which alow for gene amplification. In eukaryotie systems,, 
these include the dlhydttrfolste reductase gene which is amplified in the presence of 
methotrexate, and the metaliothionein genes which are amplified with heavy metals. In these 
cases, the nucleic acid sequence encoding the polypeptide would be operabiy linked with the 
5 regulatory sequence. 

Expression Vectors 

The present invention also relates to recombinant expression vectors comprising a 
nucleic acid sequence of the present ihveniioOi a promoter, and transcriptional and 

10 translates! stop signals. The various nucleic acid and control sequences described above 
may be joined together to produce a recombinant expression vector -which may Include one 
or more convenient restriction sites to allow for insertion or substitution of the nucleic: acid 
sequence encoding the polypeptide at such sites. Alternatively, the nucleic acid sequence of 
the present invention may be expressed by inserting the nucleic acid sequence or a nucleic 

is acid construct comprising the sequence Into an appropriate vector for expression. In creating 
the expression vector, the coding sequence is located in the vector so that the coding 
sequence is operabiy linked with the appropriate control sequences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 
can be conveniently subjected to recombinant DMA procedures and can bring about the 

20 expression of the nucleic acid sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host ceil into which the vector is to be introduced. The 
vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, /,e.< a vector which exists as 
an extrachromosornal entity, the replication of which is independent of chromosomal 

25 replication, e.g., a plasmid, an extrachromosomal element, a rninlchramosorne, or an artificial 
chromosome. The vector may contain any means for assuring self-replication. Alternatively, 
the vector may be one which, when introduced Into the host call, is integrated into the 
genome and replicated together with the chromosome{s) Into which it has been integrated. 
Furthermore, a single vector or plasmid or two or more vectors or plasmids which together 

30 contain the total DMA to be introduced into the genome of the host cell, or a transposon may 
be used. 

The vectors of the present invention preferably contain one or more selectable 
markers which permit easy selection of transformed cells, A selectabie marker Is a gene the 
product of which provides for biocide or viral resistance, resistance to heavy metals, 
35 proiotrophy to autotrophs, and the ilka. Examples of bacterial selectable markers are the da/ 
genes from Bacillus $uMstfs or Bso#ua fmh&mformls, Suitable markers for yeast host cells are 
ADE2, HIS3, LEU2, LYS2 S MET3, TRF1 , and URA3. Selectable markers for use In a 



filamentous fungal host cell Include, but are not limited to, smdS Cacefamidase), argB 
(ornithine cartoamoyitoansfBmse), Mr Iph^pfitoothiieto acetyltransferase), hygS (bygromycin 
phosphotransferase}, niaD (nitrate rsdudase},. pyrQ : <oro#3fe^-#osphMe decarboxylase), 
sC (sulfate adanyltranefera&e), trp£ .(s^thiBftitete synthase), as well as equivalents thereof. 
S Preferred for use in an A$pa®®u$ #®« are the amds and pyr© genes of Aspergiikm nkfulam 
or Aspesyilkm wyme and the Mr gene of Stmpfomycm hygimcapieos, 

The vectors of the present Invention pteferabiy contain an elements} that permits 
stable integration of the vector Into the host cell's genome or autonomous replication of the 
vector in fie ceil independent of the genome. 

10 For integration Into the host cell genome, the vector may rely on the nucleic acid 

sequence encoding the polypeptide or any other element of the vector for stable integration 
of the vector into the genome by homologous or nonhomologous recombination. 
Alternatively, the vector may contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host call. The additional 

15 nucleic acid sequences enable the vector to be Integrated into the host cell genome at a 
precise location(s) in the chromosome(s). To increase the likelihood of integration at a 
precise location* the integrations! elements should preferably contain a sufficient number of 
nucleic acids, such as 100 to 1,600 base pairs, preferably 400 to 1,500 base pasnsyand most 
preferably 800 to 1,500 base pairs, which are highly homologous with the corresponding 

20 target sequence to enhance the probability of homologous recombination. The Integratlonai 
elements may be any sequence that is homologous with the target sequence In the genome 
of the host cell. Furthermore, the infegrateal elements may be non-encoding or encoding 
nucleic acid sequences. On the other hand, the vector may be integrated Into the genome of 
the host call by hoh-homologous recombination, 

25 For autonomous replication, the vector may further comprise ah origin of replication 

enabling the vector to replicate autonomously In the host ceil in question, Examples of 
bacterial origins of replication are the origins of replicate of plasmids p8R322, pUC19, 
pAGYC177, and pACYC184 permitting replicate in £ coif, and pUBIIO, pE194, pTA1060, 
and pAfylftl permitting replication in Bacillus. Examples of origins of replication for use in a 

•30 yeast host eel! are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 
and CEN3, and the combination of m$4 and €B46. The origin of replicate may be one 
having a mutation which makes it fypefensng temperatwe-sensitlve In the host ceil (see, e.g., 
Ehriich, 1978, Proceedings of the National Academy of Sciences USA 76; 1433). 

More than one copy of a nucleic acid sequence of the present Invention may be 

3S inserted Into the H0st^fttoihc^^:p^uciK5 of ,f>e gene product, An increase In .the copy 
number of the nucleic acid sequence can be obtained by integrating at least one additional 
copy of the sequence Into the host celt genome or by including an amplifiabie selectable 



marker gene with dhe nucleic acid sequence where cells containing amplified copies of the 
selectable marker gene, and thereby additional copies of the nucleic acid sequence, can be 
selected lor by cultivating the cells in the presence of the appropriate selectable agent, 

The procedures used Ic ligafe trie elements described above to construct the 
5 recombinant expression vectors of the present invention are vyafi known to one skilled in the 
art (see, e.g.., Sarnbroekei al, 1089, supra). 

The protease may also be co-expressed together with at least one other enzyme of 
interest for animal feed, such as aipha-amyiase (EG 3.2,1.1), phytase (EC 3.1,3.8 or 
3.1.3.28); xylanase (EC 3.2.18); gaiacfariase (EC 3.2,1,89); alpha-galactosfdaae (EC 

10 3.2.1.22); protease {EC 3.4.-."}, phosphatase Al {EC 3.1.1.32); phosphatase A2 (EC 
3.1.1.4); iysopbesphotipase (EC 3,11.5); phosphatase C (3,1.4.3); phospholipase D (EC 
3.1.4.4); and/or beta-gluoanase (EC 3,2-14 or EC 3.2,18). 

The enzymes may be co-expressed from different vectors, from one vector, or using a 
mixture of both techniques. When using -different vectors, the vectors may have different 

15 selectable markers, and different origins of replication. When using only one vector, the 
genes can be expressed from one or more promoters, if cloned under the regulation of one 
promoter (di- or muithelstronic}, the order in which the genes are cloned may affect the 
expression levels of the proteins. The protease may also be expressed as a fusion protein, 
i.e. that the gene encoding the protease has been fused in frame to the gene encoding 

20 another protein. This protein may be another enzyme or a functional domain from another 
enzyme, 

Host Cells 

The present invention also relates to recombinant host cells, comprising a nucleic 
2S acid sequence of the invention, which are advantageously used in the recombinant 
production of the polypeptides, A vector comprising a nucleic acid sequence of the present 
invention is introduced info a host cell so thai the vector Is maintained as a chromosomal 
integrant or as a self replicating extra-chromosomal vector as described earlier. The term 
"host ceil" encompasses any progeny of a parent cell that Is not identical to the parent ceil 
30 due to mutations that occur during replication. The choice of a host cell will to a large extent 
depend upon the gene encoding the polypeptide and its source. 

The host cell may he a uniceilular mleroorgenism, e.g., a prokaryote, or a non- 
unicellular microorganism, e.g., a eukaryote. 

Useful uniceilular ceils are bacterial cells such as gram positive bacteria Including, but 
SS not limited to, a Bacfflus cell, or a Stmplomys&s eeiv or ceils of lactic acid bacteria; or gram 
negative bacteria such as E GQflm$!^midomQftmw< lactic acta bacteria Include, hut are 
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not limited to, species of the Qmma- LmGtommu^ La<MhacMu$, L&UGonasivc, Simp- 
iococaus, Padhmccus, and Bnl&mcocctis. 

The Introduction of a vector Mo & bacterial host ceil may, for Instance, be effected by 
protoplast transformation (see, e.g„ Chang and Cohen, 1979, Molecular General Genetics 
5 168: 111-11% using competent cells {ses, ,e>g,, Young and Spteizin, 1961, Journal of 
Bacteriology 81: 823-829, or Dubnay and DavldomAbateon, 1971 , Journal of Molecular 
Biology 56: 209-221), eteciropofatksn (see, e.g., Snsgekawa and Dower, 1988, Blotechnlques 
8: 742-751 }, or conjugation (see, e,g. s Koehier and Thome, 1987, Journal of Bacteriology 
169:5771-5278), 

to The host cell may be a eukaryote, such as a non-human animal cell, an Insect call, a 

plant cell or a fungal cell 

In one particular embodiment; the host cell is a fungal cell. "Fungi" as used herein 

Includes the phyla Aseomyoofa, Basidsomycots, Chyfridlomycota, and Zygomycete (as 

defined by Hawksworth et at, In, Alnswodh and Bispy's Dictionary of The Fungi, 8th edition, 
15 1995, CAB International, University Press, Cambridge, UK) as well as the Oomycola (as 

cited in Hawksworth el a!., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et 

al, 199S, supra), 

in another particular embodiment, the fungal host cell is a yeast ceil. "Yeast" as used 
herein Includes ascosporcgenous yeast (Endomycefaies), basldiosporogenous yeast, and 
M yeast belonging to the Fungi iniperfecti {Blastomyces), . 0lnice .^ ; ^««s|fic^pri of yeast 
may change In the future, for the purposes of this invention, yeast shall be defined as 
described In Biology end Activities of Yeast (Skinner, FA, Passmore, $M„ and Davenport, 
R.R., eds, Soc, App. Bacterid, Symposium Series No. 9, 1980). 

The yeast host celt may be a Canada, Hansmute, Kluyv®mmyc&$, Plclm, 
25 SaccMmmyo&s, SQhizosacQhBnimyG&s, or Varruma cell. 

The fungal host ceil may be a filamentous fungal cell. ''Filamentous fungi" Include all 
filamentous forme of the subdivision Eumyeoia and Oomycoia (as defined by Hawksworth et 
ai., 1995, supra). The filamentous fungi are characterized by a mycelial wall composed of 
chitin, cellulose, giucan, chltosan, mannan, add other complex polysaccharides. Vegetative 
30 growth is by hyphai elongation and carbon oataboiism is obllgafely aerobic. In contrast, 
vegetative growth by yeasts such as Sacchammyces eerewsfae Is by budding of a unicellular 
thailus and carbon oataboiism may be fermentative. 

Examples of filamentous fungal host cells are cells of species of, but not limited to, 
Acmmonkim, Aspergillus, Fmatlum, Humicola, Mooor; Myawophthom, Neumspom, 
3S Pmltitthtm, Thi&lavm, Tolypo&iadkm.Qr TiMmdmm. 

Fungal cells may be transformed by a process involving protoplast formation, 
transformation of the protoplasts, and regeneration of the ceil wall in a manner known perse. 
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Suitable procedures for transfoft^atioo of Aspergillus host calls are described In BP 238 023 
and Yaftoh et a!,, 1984, Proceedings of -'the National Academy of Sciences USA 81; 1.470- 
1474- Suitable methods fertraosfarffiing: f mamm species are described by Ivtaiardier et a!. ,, 
1989, <3&m 78; 147-158 and WO 98/00787. Yeast may be transformed using the procedures 
5 described by Becker and Guamnte,. in AheiseOi J.N. and Sinien, MX, editors, Guide to Yeast 
Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, 
Academic Press, Inc., New York; Vfo ei at, 1988, Journal of Bacteriology 183; 183; and 
Htnnen et al, 1978, Proceedings of the National Academy of Sciences USA 75: 1920. 

10 Methods of Production 

The present invention aiso relates to methods for producing a polypeptide of the 
present Invention comprising (a) cultivating a strain, which in its wild-type form is capable of 
producing the polypeptide; and (b) recovering the .polypeptide. Preferably,, the strain is of the 
genus ^ocard/opsfe, more preferably Nacwdtopsis dassonvifiel Nocardkpm alba, or 
15 NoGartfiopsis antamtioa, more preferably /vbcard/ops/s damoovfflei mhsp, 0msonv$aL 

The present Invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a host eel! under conditions conducive for 
production of the polypeptide; and (b) recovering the polypeptide. 

The present Invention aiso relates to methods for producing a polypeptide of the present ■■ 

20 invention comprising (a) cultivating a host cell under conditions conducive for production of 
the polypeptide, wherein the host cell comprises a mutant nucleic acid sequence comprising 
at least one mutation In nucleotides 499-1062 of SEQ ID MO; 1, 577-1184 of SEQ ID NO; 7, 
586-1152 of SEO ID MO; 9, and/or 502-1068 of SEQ ID NO; 1 1 , in which the mutant nucleic 
acid sequence encodes a polypeptide which $ consists of amino acids 1 to 188 of SEQ ID 

25 NO; 2, 1 to 1 88 of SEQ ID NO; 8, 1 to 1 09 of SEQ ID NO: 1 0, and/or 1 to 1 88 of SEQ ID NO: 
12, or (ii) is a variant of any of the sequences of (I), wherein the variant comprises a 
substitution, deletion, and/or Insertion of one or more amino adds, or (lis) is an allelic variant 
Of any of the sequences of {{), or (iv) is a fragment of any of the sequences of (I). 

In the production methods of the present invention, the cells are cultivated in a 

30 nutTteni medium suitable for production of the polypeptide using methods known in the art 
For example, the ceil may be cultivated by shake flask cultivation, small-scale or large-scale 
fermentation (including continuous, batch, fed-batch, or solid state fermentations} in 
laboratory or industrial tormentors performed In a suitable medium and under conditions 
allowing the polypeptide to be expressed and/Or Isolated. The cultivation takes place In a 

35 suitable nutrient medium comprising carbon and n'^^'se^rt^-wl. inorganic salts, using 
procedures known in the art. Suitable: media are available from commercial suppliers or may 
be prepared according to pubffcfted: .c^pd«IS#is.-'<ag,, M .catalogues of the American Type 
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Culture Collection), If the polypeptide is secreted info the nutrient medium, the polypeptide 
can be recovered directly fern the medium, if the polypeptide is not secreted, It can be 
recovered from eeii lysafes. 

The polypeptides may he detected using methods known Jh the art that are specific 
5 for the polypeptides. These detection methods may Include use of specific antibodies, 
formation of a product, or disappearance of a substrate, For example, a protease assay may 
be used to determine the activity of the polypeptide as described herein. 

The resulting polypeptide may he recovered by methods known in the art. For 
example, the polypeptide may bo recovered, from: the nutrient medium by conventional 
10 procedures including, but not limited to, centrifugation, filtration, extraction, .spray-drying, 
evaporation, or precipitatiom 

The polypeptides of the present invention may he purified by a variety of procedures 
known in the art including, hut not iimited tor ohromatog?Bphy (e.g., Ion exchange, -affinity, 
hydrophobic, ehromafofocusirKp and size exclusion}, efectrophoretie procedures (e.g., 
IS preparative Isoelectric focusing), differential solubility (eg., ammonium sulfate precipitation}, 
-SDS-PAGE, or extraction (see, e.g.. Protein Purification, J.-C. Janson and lars Pydep, 
editors, VCH Publishers, Maw York, 1980), 

Plants 

20 The present Invention also relates to a transgenic plant, plant part, or plant set! which 

has been transformed with a nucleic acid sequence encoding a polypeptide having protease 
activity of the present invention so as to express and produce the- polypeptide In recoverable 
quantifies. The polypeptide may he recovered from the plant or plant part. Alternatively, the 
plant or plant part containing the recombinant polypeptide may be used as such for 

25 improving the qualify of a food or feed, eg., improving nutritional value, paiatabity, and 
rheoiogical properties, or to destroy an pniinuiritive factor. 

In a particular embodiment, the polypeptide Is targeted to the endosperm storage 
vacuoles in seeds. This can he obtained by synthesizing it as a precursor with a suitable 
signal peptide, see Horvath el a! in PNAS, Fob, 15, 2000, vol 97, no. 4, p, 1914-1919, 

30 The transgenic plant can he dicotyledonous (a dicof} or monocotyledonous (a 

monocot} or engineered variants- thereof. Examples of monocot plants are grasses, such as 
meadow grass (blue grass, Poa), forage grass such as Festoea, Lo&ro, temperate grass, 
such as Agrosfis, and cereals, e,g., wheat Oats, rye, barley, rice, sorghum, thticaie 
(stabilized hybrid of wheat (Tricimm) and rye {SeeaM}, and maize (corn). Examples of dicot 

3-5- plants are tobacco, legumes, such as sunflower (Helmnthus), cotton (Gossypium) lupins, 
potato, sugar beef, pea, bean and soybean, and cruciferous plants (family Brassicsceaej, 
such as cauliflower, rape seed, and the closely related model organism MmhkS&psis ihalmm. 
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Low~phytate plants as described e.g., in US patent no, 5y689,054 and US patent no. 
8,1 i 1,188 are examples of engineered plants. 

Examples of plant parts am stem, callus, leaves, root, fruits, seeds, and tubers, as 
well as the individual tissues comprising these parts, e.g. eplderrms, mesophyli. parenchyma, 
§ vascular tissues, nierlstams. Also sfcesffte plant ceil compartments, such as chloroplast, 
apaplasi mitochondria, vacuole, peroxisomes, and cytoplasm are considered to be a plant 
part. Furthermore, any plant coll, whafevwihe tissue origin, Is considered to be a .plant part 
Likewise, plant parts such as specific tissues and cells Isolated to facilitate the utilisation of 
the invention are also considered plant parts, e^j. embryos., endosperms, aleurone and seed 
10 coats. Also Included within the scope of the present invention are the progeny of such plants, 
plant parts and plant cells, 

The transgenic plant or plant cell expressing a polypeptide of the present invention 
may he constructed In accordance with methods known in the art. Briefly, the plant or plant 
cell is constructed by incorporating one or more expression constructs encoding a 
1 5 polypeptide of the present invention into the plant host genome and propagating the resulting 
modified plant or plant cell into a transgenic plant or plant celt 

Conveniently, the expression construct is a nucleic acid construct which comprises a 
■nucleic acid sequence encoding a polypeptide of the present invention oparably linked with 
appropriate regulatory sequences required for expression of the nucleic acid sequence in the 
20 plmt or plant part of choice. Furthermore, the expression construct may comprise a 
Selectable marKer useful for identifying host ceils into which the expression construct has 
been Integrated and DMA sequences necessary for introduction cf the construct into the 
plant in question (the latter depends on the DMA Introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences and 
2S optionally signal or transit sequences are determined, for example, on the basis of whan, 
where, and how the polypeptide is desired te fee expressed. For instance, the expression of 
the gene encoding a polypeptide of the present invention may be constitutive or inducible, or 
may be developmental, stage or tissue specific, and the gene product may be targeted to a 
specific tissue or plant part such as seeds or leaves. Regulatory sequences are, for example, 
30 described by Tague at ai , i 988, Plant Physiology 88: 808. 

For constitutive expression, the following promoters may be used: The 35S~CaMY 
promoter (Franck at al. s 1980, Call 21: 285-294), the maize ubiqultln 1 (Christensen AH, 
Sharrock RA and Quail 1992. M*am polyuhiquitin genes; structure f thermal perturbation of 
expression and transcript spiicing, and promoter activity following transfer to protoplasts by 
3§ electroporaiiorO, or the rice aetin 1. promoter {Plant Mo, Biol. 18, 87S-88&: Ihang W, McElroy 
D. and Wu R 1991, Analysis of rioe Aotl 6* region activity In transgenic rice piants. Plant Ceil 
3, 11 55-1185). OrgarMSpecifie promoters may be f for example, a promoter from storage Sink 
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tissues such as seeds, potato tutors, and fruits {Edwards & Coruzsl 1990, Ann, Rev. Genet 
24: 275-303), or from metabolic sink tissues such as mensfems po et at, 1894, Plant Mol. 
Biol, 24: 863-878), a seed specific promoter such as #58 giutelin, protarnln, globulin, or 
•albumin promoter from rice (Wu etal, 1888, Plant and Call Physiology 39: 885-889), a V&ia 
s feba promoter from the legurnin 84 and the unknown seed protein gene from Vicia fata 
{Conrad at ah, 1998, Journal of Plant Physiology 152: 708-71 1), a promoter from a seed oil 
body protein (Chen etal., 1998, Plant and Gail Physiology 39: 835-941), the storage protein 
napA promoter from Srmstes mpus t or any other seed specific promoter known in toe art, 
e.g., as described in WO 91/14772. Furthermore, the promoter may be a leaf specific 
10 promoter such as the rbcs promoter from rice or tomato (Kyozuka ei at, 1993, Plant 
Physiology 102; 981-1000, the chioreiia virus adenine methyltransferase gene promoter 
(fvfifra and Higgles, 1994, Plant Molecular Biology 26: 85-93), or the aldP gene promoter from 
rice (Kagaya et a!,, 1995,. Molecular and General Genetics 248: 868-874), or a wound 
inducible promoter such as fie potato pm2 promoter {Xu et at, 1993, Plant Moiecular Biology 
15 22: 573-588). likewise, the promoter may be inducible by abiotic treatments such as 
temperature, drought or alterations In salinity or inducible by exogenously applied 
substances that activate the promoter, e,g. ethane!, oestrogens, plant hormones like 
ethylene, absetsle acid, glbberellie acid, and/or heavy metals. 

A promoter enhancer element may also be used to achieve higher expression of tee 
20 protease in the plant. For instance, the promoter enhancer element may be an infron which is 
placed between the promoter and the nucleotide sequence encoding a polypeptide Of the 
present invention. For instance, Xu et at, 1993, supra disclose the use of the first ihtron Of 
the rice actin 1 gene to enhance expression, 

Still further, the codon usage may be optimized for the plant species in question to 
2§ Improve expression (see Horvath et ai referred to above), 

The selectable marker gene and any other pans of the expression construct may be 
chosen from those available in the art. 

The nucleic acid construct is incorporated info the plant genome according to 
conventional techniques known in the art, Including AgmOacfef^m-mediafed transformation, 
SO virus-mediated transformation, microinjection, particle bombardment, hloiistic transformation, 
and electroporatbn {Gasseretat, 1990, Science 244: 1293; Potrykus, 1990, Bio/Technology 
8: 533; Shimamoto et at, 1989, Natere 338: 274). 

Presently, Agmbacierium temefacfe/Ts-mediated gene transfer is the method of 
choice for generating transgenic dicois (for a review, see Hooykas and Schllperoork 1962, 
35 Plant Molecular Biology 19; 16-38), and it can also be used for transforming monocots, 
although other transformation methods are generally preferred for these plants, Presently, 
the method of choice for generating transgenic monocots, supplementing the Agrobacteflum 
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approach, is particle bombardment hYUcroscopie gold or fuhgste-n pedicles coated with the 
irarssformfng- DNA) of smbryonle call! or developing embryos {Ghrisfou, 1992, Plant Journal 
2: 275-281; Shimamoto, 1854, Current Opinion Biotechnology 5: 158-182; Vasll at aL, 1992, 
Bio-Technology 10; 887-674), An alternative metiod for transformation of monocots Is based 
S on protoplast traosfomiaiJon as described by Grnirulleh et al t i§88> Plant Molecular Biology 
21:415-428. 

Following transformate, the transforrhants haying Incorporated therein the 
expression construct are selected and regenerated Into whole plants according to methods 
well-known in the art, 

10 The present invention also relates to methods for producing a polypeptide of the 

present Invention comprising (a) cultivating a transgenic plant or a plant eel! comprising a 
nucleic add sequence encoding a polypeptide having, protease activity of the present 
Invention under conditions conducive tor production of the polypeptide; and (b) recovering 
the polypeptide, 

15 

Animais 

The present invention also relates to a transgenic, non-human animal and products or 
elements thereof; examples of which are body fluids such as milk and blood, organs, flesh, 
and animal ceils. Techniques for expressing proteins, e>g. in mammal fan ceils, are known In 

20 the art, sea e.g. the handbook. Protein expression; A Practical Approach, Higgins and Names 
(em, Oxford University Press (1999), and the three other handbooks In this series relating to 
Gene Transcription, RMA processing, and Poshtransiatlonal Processing. Generally' speaking 
to prepare a transgenic animal, selected ceils of a selected anlmai are transformed with a 
nucleic acid sequence encoding a polypeptide, having protease activity of the present 

25 invention so as. to express and produce the polypeptide. Hie polypeptide may he recovered 
from the animal, e.g. from the milk of female animals, or the polypeptide may be expressed 
to the benefit of the animal itself, e.g, to assist the animal's digestion. Examples of animais 
are mentioned below in the section headed Animal Feed. 

To produce a transgenic animal with a view to recovering the protease from the milk 

3d of the animal, a gene encoding the protease may be inserted into the fertilised eggs of an 
animal in question, e.g. by use of a transgene expression vector which comprises a suitable 
milk protein promoter, and the gene ehcoding the protease. The transgene expression vector 
Is microinjecied Into fertilized eggs, and preferably permanently integrated into the 
chromosome, Once the egg begins to grow and divide, the potential emhryo is implanted into 

'35 a surrogate mother, and animals carrying the transgene are Identified. The resulting animal 
can men he multiplied by conventional breeding. The polypeptide may be purified from the 
animal's milk, see e,g. Meade, H.M. ef at (1999): Expression of recombinant proteins in the 



milk of transgenic animals, Gene expression systems: Using nature for the art of expression. 
J. M. Fernandez and J. R-;Hpil^ : <edjS.}» : A^demfe-.Pm^ 

In the alternative, in order to produce a transgenic non-human animal that carries in 
the genome of its somatic and/or germ ceils a nucleic acid sequence including a 
5 heterologous transgen® construct Including a transgene encoding the protease, the 
transgene may be operably Jinked to a first regulatory sequence for salivary gland specific 
expression of the protease, as disclosed in WO 00/0842^7. 

Compositions 

10 In a still further aspect the present invention relates to compositions comprising a 

polypeptide of the present invention. 

The polypeptide compositions may be prepared in accordance with methods known 

in the art and may he in the form of a liquid or a dry composition. For instance, the 

polypeptide composition may be in the form of a granulate or a rnlorogranuiafe. The 
is polypeptide to he included in the composition may be stebfeed In accordance with methods 

known in the art, 

Examples are given below of preferred uses of the polypeptides or polypeptide 
compositions of the Invention, 

20 Animal Feed 

The present invention is also directed to methods for using the polypeptides of the 
invention in animal feed, as- well as to feed compositions and feed addifves eomphsirig the 
polypeptides of the invention. 

The term animal includes ell animals. Including human beings. Examples of animals 
25 are non-ruminanis, and ruminants. Ruminant animals include, for example, animals such as 
sheep, goats, horses, and cattle, e.g. beef cattle, eows f and young calves. In a particular 
embodiment, the animal Is a non-ruminant animal Mon~rumsnant animals include mono- 
gastric animals, e.g. pigs or swine (including, but not limited to, piglets, growing pigs, and 
Sows); poultry such as turkeys, ducks and chicken (including but not limited to broiler chicks, 
30 layers); young calves; and fish (including hut not limited to salmon, trout, tiiapia, catfish and 
carps; and crustaceans {including hut not limited to shrimps and prawns). 

The term feed or feed composition means any compound, preparation, mixture, or 
composition suitable for, or intended for intake fey an animal. 

in the use according to the Invention die protease can he fed to the animal before, 
3S after, or simultaneously with the diet The latter is preferred. 

in a particular embodiment, the protease, in the form in which it is added to the feed, 
or when being included in a feed additive, is welNMined, Well-defined means that the 
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protease preparation is at least 30% pure as determined by Sfee-exeiusion chromatography 
(see Example 12 of WO 01/58275), In other particular -embodiments the protease preparation 
is at least §0, TO, 80, BS S OS, 90, 92, 94, or at least 96% pure as determlnad by this method. 
A well -defined protease preparation Is advantageous. For Instance, it Is much easier 
S to close correctly to the feed a protease that Is essentially free from Interfering or 
contaminating other proteases. The term dose correctly refers In particular io the objective of 
obtaining consistent and constant results, and the capability of optimising dosage based 
upon the desired effect. 

For the use in animal feed, however, the protease need not be that pure; It may e.g. 
10 include other enzymes, in which case It could fee termed a protease preparation. 

The protease preparation can be (a) added directly to the feed (or used directly in a 
treatment process of proteins), or (b> it can be used In the production of one or more 
Intermediate compositions such as feed additives or premixes that is subsequently added to 
the feed (or used In a treatment process). The degree of purity described above refers to the 
1 5 purity of the original protease preparation, whether used according to (a) or (b) above. 

Protease preparations with purities of this order of magnitude are in particular 
obtainable using recombinant methods of production, whereas they are not so easily 
obtained and also subject to a much higher batdvto-bateh variation when the protease is 
produced by traditional fermentation methods. 
20 Such protease preparation may -of course be mixed with other enzymes* 

In a particular embodiment, the protease for use according to the invention is capable 
of soiublilsfng proteins. A suitable assay for determining solubllised protein Is disclosed in 
Example 3. 

The protein may be an animal protein, such as meat and bone meal, and/or fish meal; 

is or it may be a vegetable protein. 

The term vegetable proteins as used herein refers to any compound, composition, 
preparation or mixture that includes at least one protein derived from or originating from a 
vegetable, including modified proteins, and protein-derivatives, in particular embodiments, the 
protein content of the vegetable proteins is at least 10, 20, 30, 40, 50, or §0% (w/w), 

30 Vegetable proteins may be derived from vegetable protein sources, such as legumes 

and cereals., for example materials from plants of the families Fabacem (Legumimsm^ 
Cnscifemoeae, Chenopodiaceae, and Poac&m, such as soy bean meal, lupin meal and 
rapeseed meal. 

In a particular embodiment, the vegetable protein source is material from one or more 
35 plants of the family Fabaceae, e.g. soybean, lupine, pea, or bean. 

In another particular embodiment, the vegetable protein source is material from one 
or more plants of the family £herwp0<$aceae, e.g. heet> sugar beef, spinach or qulnoa. 



Other examples of vegetable protein sources are rapeseed, sunflower seed, cotton 
seed, and cabbage. 

Soybean Is a preferred vegetable protein source. 

Other examples of vegetable protein sources are cereals such .as barley, wheat, rye. 
8 oat, make (com), flee, trlicale, and sorghum. 

The treatment according to the invention of profete with at least one protease of the 
invention results in an Increased solubllisaflon of proteins. 

The following are examples of % soiuhised protein obtainable using the proteases of 
the Invention in a monogastrfc to vitro model: At least 102%, 103%, 104%, 105%, 108%, or 

10 at least 107%, relative to a blank. The percentage of soiubiilsed protein is determined using 
the monogastrie to vitro model of Example 3 and/or Example 10. The term solubiiisatlon of 
proteins basically means bringing protein(s) Into solution. Such solubilization may be due to 
protest mediated release of protein from other components of the usually complex natural 
compositions such as feed. 

15 In a further particular embodiment the protease for use according to the invention lis 

capable of increasing the amount of digestible proteins. The following are examples of % 
digested or digestible protein obtainable using the proteases of the invention in a 
monogastrie to vitro model: At least 104%, 105%, 108%, 107%, 108%, 109%, 110%, 111%, 
112%, 113%, 1 14%, 115%, or at least 1 18%, relative to a blank. The percentage of digested 

20 or digestible protein Is determined using the to vitro model of Example 3 and/or Example 10, 
The following are examples of % digested or digestible protein obtainable using the 
proteases of the Invention in an aquacultura fa vitro model: At feast 103%, 104%, 105%, 
106%, 107%, 108%, 109% of at least 110%, relative to a blank. The percentage of digested 
or digestible protein is determined using the aquacalture to vitro model of Example 4. 

£5 in a still further particular embodiment, the protease for use according to the invention 

Is capable of increasing the Degree of Hydrolysis (DH| of proteins. The following are 
examples of Degree of Hydrolysis increase obtainable in a monogastrie to vitro model: At 
least 102%, 103%, 104%, 105%, 108%, or at least 107%, relative to a blank. The DH is 
determined using the monogastrie in vitro model of Example 3. The following are examples 

30 of Degree of Hydrolysis increase obtainable in an aquaeuHwe in vitro model: At least 102%, 
103%, 104%, 105%, 108%, or at least 107%, relative to a blank. The DH is determined using 
the aqusculture to vitro mode! of Example 4. 

In a particular embodiment of a fore-} testmeot process of the Invention, the 
profeasa(s) In question Is affecting for aclng cn, or exerting its soiubtiising influence on) the 

35 proteins or protein sources. To achieve this, the protein or protein source is typically 
suspended In a solvent, e.g. an aqueous solvent such as water, and the pH and temperature 
values are adjusted paying due regard to the characteristics of the enzyme in question. For 



example, the treatment may take place at a pH-vaiua at which the activity of the actual 
protease is at least at least 48%, 80%, 80%, 70%, 80% or at least 80%, Likewise, for 
example, the treatment may take place et a temperature at which the activity of the actual 
protease is at least 40%, 50%, 80%, 70%. 80% or ai least 90%. The above percentage 
s activity indications are relative to the maximum activities. The enzymatic reaction is 
continued until the desired result is achieved, following which it may or may not he stopped 
by inactivating the enzyme, e,gj by a heat-ireatent step. 

In another particular embodiment of a treatment process of the Invention, the 
protease action Is sustained, meaning e.g. that the protease is added to the proteins or 
10 protein sources, hut its solubllising influence is so: to speak not switched on until later when 
desired, once suitable solublfising conditions are established, or once any enzyme inhibitors 
ere Inactivated, or whatever other means could have been applied to postpone the action of 
the enzyme, 

In one embodiment the treatment is a pre^reatment of animal feed or proteins for use 
1 5 in animal feed. 

The term improving the nutritional value of an animal feed means improving the 
availability and/or digestibility of the proteins, thereby reading to Increased protein extraction 
from the diet components, higher protein yields, Increased protein degradation andtof 
Improved protein utilisation. The nutritional value of the feed Is therefore increased, and the 

20 animal performance such as growth rate and/or weight gain and/or feed conversion ratio |i.e. 
the weight of ingested feed relative to weight gain) of the animal is/are improved. 

in a particular embodiment the feed conversion ratio Is increased by at least 1%, 2%, 
3%, 4%, 5%, 6%, 7%, 8% s 9% or at least 10%. iha further padlcuiar embodiment the weight 
gain is increased by at least 2%, 3%, 4%, 5%, 8%, ?%, 8%, 9%, 10% or at least 11%, These 

25 figures are relative to control experiments with no protease addition. 

The feed conversion ratio (PGR), and the weight gain may be calculated as described 
in EEC (1988): Directive de la Commission du 9 avril 1988 fixant la methode de ealcu! de la 
vateur enargetigua dee ailments composes destines a la volaiile, Journal Official des 
Communautes Europeennes, L130, 53 -84. 

30 The protease can be added to tie reed in any form, be it as a relatively pure 

protease;, or in admixture with other components Intended for addition to animal feed, he in 
the form of animal feed additives, such as the so-called pre~mixes for animal feed. 

In a further aspect the present invention relates to compositions for use in animal 
feed, such as animal feed, and animal feed additives, e.g; ©remixes. 

35 Apart from the protease of the invention, the animal feed additives of the Invention 

contain at least one fat-soluble vitamin, ahd/br at least ode water soluble vitamin* and/or at 
least one trace mineral The feed additive may also contain at least one macro mineral. 



Further, optional, feed-addife/e ingredients are colouring agents, e>g. carotenolds 
such as beta-carotene, asiaxanthln, and lutein; aroma compounds; ; siBhlllsers; antimicrobial 
peptides; polyunsaturated fa% acids; reactive oxygen generating species; and/or at least 
one other enzyme selected from amongst alpha-amylase (EC 3;2,1.1), phytase (EC 3.1,3.8 
5 or 3.1.3.26); xylanase (BO 3,2.18); galacianase (EC 3.2.1.8% aipha--gaiactosidase (EC 
3.2,1.22); protease (EC 3.4.-.-}, phosphollpase Ai (BO 3.1.1.32); phosphoilpase A2 (EC 
3.11,4}; iysophospheiipase (EC 3.1. IS); phosphoiipase C (3,1.4.3); phosphoilpase D (EC 
3.1 .4.4): amylase such as, for example, aipha^myiase (EC 3,2.1.1); and/or beta-glucanase 
(EC 3.2.1.4 or EC 3.2.1.6). 
10 In a particular embodiment these other enzymes are welklefined (as defined above 

for protease preparations). 

Examples of antimicrobial peptides (AMP's) are CAP1S, Leucodn A, Triirpticln, 
Prologon-I, Thanattn,- Defensim Lactoferrin, Lactofemetn, and Ovlspirln such as NovlspMn 
(Robed Lehrer, 2000), Pieetasins, and Statins, including the compounds and polypeptides 
is disclosed In WO 03/044049 and WO 03/048141, as wall as variants or fragments of the 
above that retain antimicrobial activity. 

Examples of antifungal polypeptides (AFP%) ere the Asp&rg$us gigantism, and 
A$p&rgiiim nsg&- peptides, as well as variants and fragments thereof which retain antifungal 
activity, as disclosed in WO 84/01459 and WO 02/090384. 
20 Examples of polyunsaturated fatty acids are C18, G20 and 022 polyunsaturated fatty 

acids, such as arachidonic add, docoeohexaenolc acid, eicosapentaenofe acid and gamma" 
linoieic acid. 

Examples of reactive oxygen generating species are chemicals such as perborate, 
persulphate, or percarhonate; and enzymes such as an oxidase, an oxygenase or a 
25 syntathase. 

ti sally fat- and water-soluble vitamins, as well as trace minerals form part of a so- 
called premix Intended for addition to the feed, whereas macro minerals are usually sepa- 
rately added to the feed. A premix enHched with a protease of the invention, is an example of 
an animal feed additive of the invention. 

30 In a particular embodiment, the animal feed additive of the invention is intended for 

being included (or prescribed as having to be Included) in animal diets or feed at levels of 
0.01 to 10,0%; more particularly 0.05 to 5,0%: or 0,2 to 1,0% (% meaning g additive per 100 
g feed). This is so in particular for premixes. 

The following are non-exclusive lists of examples of these components: 

38 Examples of fat-soluble vitamins are vitamin A, vitamin D3, vitamin E, and vitamin K, 

e.g. vitamin K3, 
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Examples of water-soluble vfiamlns afe vitamin 812, hSofin and choline, vitamin B1, 
vitamin B2, vitamin 88, niacin, folic acid and pantothenate, e.g. Ca-D-panthothsnate, 

Examples of trace minerals ore manganese, zinc, ton, copper, iodine, selenium, and 

cobalt. 

5 Examples of macro minerals are calcium, phosphorus and sodium. 

The nutritional requirements of these components '{exemplified with poultry and 
piglets/pigs} are listed In Table A of WO 01/58275. Nutrftienai requirement means thai these 
components should be provided in the diet in itio ooncohtraions indicated. 

In the alternative, the animal feed additive of the invention comprises at least one of 
10 the individual components specified in Table A of WO 01/58275. At least one means either 
of, one or more of, one. or ism s or threes or four and so forth up to ail thirteen, or up to all 
fifteen Individual components. ^Sore specifically, this, at least one individual component fs 
included in the additive of the invention In such an amount as to provide an in-feed- 
concentration within the range Indicated in column four, or column five, or column six of 
15 Tabie A. 

The present Invention also relates T to animal teed compositions. Animal teed 
compositions or diets have a relatively high content of protein. Poultry and pig diets can be 
eharaotensed as indicated In labia 8 of WO 01/58275, columns 2-3, Fish diets can be 
eharactedseci as indicated in column 4 of this Table 8, Furthenrsora audi fish diets usually 
20 have a crude fat content of 200 -310 g/kg, WO 01/58275 corresponds to US 09/779334 which 
is hereby incorporated by reference. 

An animal feed composition according to the invention has a crude protein content of 
50-800 g/kg, and furthermore comprises at least one protease as claimed herein. 

Furthermore, or in the alternative {to the crude protein content Indicated above), the 
25 animal feed composition of the Invention has a content of mefabolisabie energy of 10-30 
MMg; and/or a content of calcium of 0.1-200 g/kg; and/or a content of available phosphorus 
of 0.1-200 g/kg; am/or a content of methionine of 0,1-100 g/kg; and/or a content of 
methionine plus cysteine of 0.1-150 g/kg; and/or a confent of lysine of 0.5-50 g/kg. 

In particular embodiments, the content of metabolisabie energy, crude protein, 
30 calcium, phosphorus, methionine, methionine plus cysteine, and/or lysine is within any one of 
ranges 2, 3 , 4 or 5 in Table .8 of WO 01/58275 (R, 2*5). 

Crude protein Is calculated as nitrogen §i) multiplied by a factor 8.25, i.e. Crude 
protein (g/kg)~ N (g/kg) x 8.25. The nitrogen content Is determined by the Kjeldahl method 
(A.O.A.C., 1984, Official Methods of Analysis Uih ed„ Association of Official Analytical 
35 Chemists, Washington DC), 

Metabolisabie energy can be-calaataifed- m : #» feasts-of the NRC publication Nutrient 
requirements in swine, ninth revised edition 1988, subcommittee on swine nutrition, 



committee on animal nutrition, board of agricultural national research council. National 
Academy Press, Washington, D.G< t pp. 2*8, and the European Tabie of Energy Values for 
Poultry Feed-stuffs, Speldorhotf Gsntre for poultry -research and extension, 7381 DA 
Beekbergen, The Netherlands; Grafiseh bedrijf Ponsen SUoeijen bv, Wageningen. ISBN SO 
§ 71463-12-6- 

The dietary content of calcium, available phosphorus and amino acids In complete 
animal diets Is calculated on the basis of feed tables such ss Veevoedertabei 1997, 
gegavens over chemjssha samenstetling, verteerbaafheid on voederwaarde van 
voedermiddeisn, Centra! Vesvoedertweau, ROnderweg 8, 8219 pk Leiystad> ISBN 90- 
10 72830-13-7. 

In a particular embodiment, the animal feed composition of the invention contains at 
least one protein or protein source as defined above. It may also contain animal protein, such 
as Meat and Bone Meal, and/or fish Meal, typically in an amount of 0-25%. 

In still further particular embodiments, the animal feed composition of the invention 
15 contains 0-80% maize:; and/or 0-80% sorghum; and/or 0-70% wheat; and/or 0-70% Barley; 
and/or 0-30% oats; and/or 0-40% soybean meal; and/or G-25% fish meal; and/or 0-25% meat 
and hone meat; and/or 0-20% whey. 

Animal diets can e.g. be manufactured as mash feed (non pelleted) or pelleted feed. 
Typically, the milled feed-stuffs are mixed and sufficient amounts of essential vitamins and 
20 minerals are added according to the specifications for the species in question. Enzymes can 
he added as solid or liquid enzyme formulations. For example, a solid enzyme formulation is 
typically added before or during the mixing step; and a liquid mzyrm preparation is typically 
added after the pelleting step. The enzyme may also be Incorporated in a feed additive or 
premix, 

as The final enzyme concentration in the diet Is within the range of 0,01-200 mg enzyme 

protein per kg diet, for example in the range of 0.8-25 fng enzyme protein per kg animal diet. 

The protease should of course be applied In an effective amount, i.e. In an amount 
adequate tor Improving solubisaiion and/or Improving nutritional value of feed, It is at 
present contemplated that the enzyme Is administered in one or more of the following 

30 amounts (dosage ranges); 0,01-200; 0.01-100; 0.5-100; 1-50; 5-100; 10-100; 0.05-50; or 
0.10-10 ~ all these ranges being In mg protease enzyme protein per kg feed (ppm), 

for determining mg enzyme protein per kg feed, the protease Is purified from the feed 
composition, and the specific activity of the purified protease Is detem-sinsd using a relevant 
assay (see under protease aotwity, substrates, and assays). The protease activity of the feed 

35 composition as such is also determined using the same assay, and on the basis of these two 
determinations, the dosage in mg enzyme protein per kg feed is calculated. 



48 



The same principles apply for determining mg enzyme protein In feed additives. Of 
course, if a sample is available of the protease used for preparing the feed additive or the 
feed, the specific activity is determined torn tills sample (no need to purify the protease from 
the feed composition or the additive). 

s 

Detergent Compositions 

The protease of the invention may he added to and thus become a component of a 
detergent composition. 

The detergent composition of the invention may for example he formulated as a hand 
10 or machine laundry detergent composition including a laundry additive composition suitable 
for pre-troatment of stained fabrics and a rinse added fabric softener composition, or be 
formulated as a detergent composition for use in genera! household hard surface cleaning 
operations, or be formulated for hand or machine dishwashing operations. 

in a specific aspect, the invention provides a detergent additive comprising the 
16 protease of the invention. The detergent additive as well as the detergent composition may 
comprise one or more other enzymes such as another protease, such as alkaline proteases 
from Bacillus, a lipase, a cutinase, an amylase, a carbohydrase, a ceiluiase, a pacfinase, a 
mannanase, an anabinase, a gaiactanase. a xyiariase, an oxidase, e.g., a laccase, and/or a 
peroxidase. 

20 In genera! the properties of the chosen enzyme(s} should be compatible with the 

selected detergent, (Le. pH-optimum, compatibility with otter enzymatic and non-enzymatic 
ingredients, etc), and the enzyme{s) should be present in effective amounts. 

Suitable lipases include those of bacteria! or fungal origin. Chemically modified or 
protein engineered mutants are included. Examples of useful lipases include lipases from 

28 Hmm'cola (synonym Th&rmomyoes), e.g. from H lanuginosa. (T. ianuginosus) as described In 
BP 258088 and EP 305218 or from H< insolms as described in WO 96713580, a 
Psepdomonas lipase, eg, from R alcaiigenes or P, pseudoa/ca%enes (EP 218272), P. 
wpaoia {EP 331378), P, stu&efi (OB 1,372,034), P< 8umae®m, Pseudomonas sp, strain 
SO 70S (WO 95/08720 and WO 98/27002), R wisconsm&nsis (WO 95/12012), a Bacillus 

30 lipase, e.g, from B. suhWls. (Oariois ef ah (1993), Biochemlca at Biophysics Acta, 1 1 31 , 253- 
360), B. stearoihermopftilas (JP 84/744S02) or a prnnilm {WO 91/18422), Other examples 
are lipase variants such as those described in WO §2/08249, WO 94/01541, EP 407225, EP 
260106, WO 95/35381, WO 98/00292, WO 95/30744, WO 94/25578, WO 95/14783, WO 
95/2281 S, WO 07/04079 and WO 97/07202. Pmfermd commercially available lipase 

35 enzymes include Upolsse™ and Upoiase Urns?**' (Novozymes TVS}. Suitable amylases 
(alpha- and/or beta-) include those of bacterial or fungal origin. Chemically modified or 
protein engineered mutants are included. Amylases Include, for example, aipha-arnyiases 



obtained from Sacfe, e.g. a spaclai strain of BJie&aniformm, described in more detail in <3B 
1,296,830, Examples of useful amylases are the variants described la 'WO 94/02597, WO 
-84/18314, WO 05726397, WO 96/23873, WO 97/43424, WO 00/60060, and WO 01/66712, 
especially the variants with suhst&uiibns in one or more of the following positions; 16, 2-3, 
6 105, 106, 124, 128, 133, 164, 1S8, 181, 188, 190 ? 197. 202, 208, 209, 243, 264, 304, 305, 
391, 408, and 444, Commercially available amylases are jNatetlase™, Supramyi"*, 
Staniyme^' 5 , DurarnyP, TerrnamyP, FungamyP and BAN™ (Novozymes A/8), 
Rapldase^ald Purastar™(fmm s3eoancor International inc.). 

Suitable ceiiuiases Include those of bacterial or fungal origin, Chemically modified or 

10 protein engineered mutants are Included, Suitable ceiiuiases include ceiiuiases from the 
genera Bacillus, Ps&udomom% Hwrmoki, Fusarium, Thml&vfet, Acmmoaium, e.g, the fungal 
ceiiuiases produced from Humicola mml&m, MyGBihphihora ihenmphM and Fusamm 
oxysporum disclosed in US 4,435,307, US 5,648,263, US 5,691,178, US 5,776,757 and WO 
88/09259, Especially suitable ceiiuiases are the alkaline or neutral eeiiuiases having colour 

is cam- benefits. Examples of such eeiiuiases are ceiiuiases described in EP 0 495257;, EP 
581372:, WO 88/1 1 262. WO 96/20397, WO 88/08940. Other examples are celiulase variants 
such as those described in WO 04/07908, EP 0 531 315, US 5.457,046, US 5,688,503,: US 
5,763,254, WO 95/24471, WO 88/12307 and WO 99/01544. Commercially available 
ceiiuiases Include Cellu2yme™, and Carezyme™ (Novozymes MB), Clazinasa™, and 

20 Puradax HA™ (Geneneor International inc.), aid'KACW<8)™ (Kao Corporation}. 

Suitable peroxidases/oxidases include those of plant, bacterial or fungal origin. 
Chemically modified or protein engineered mutants are Included. Examples- of useful 
peroxidases Include peroxidases from Coprinm, e.g. from O oinersm, and variants thereof 
as those described in WO 98/24818, WO 95/10602, and WO 98/15257. Commercially 

25 available peroxidases Include Guardzyms™ {Novozymes}, 

The detergent enzyms($) may be included In a detergent composition by adding 
separate additives containing one or more enzymes, or by adding a combined additive 
comprising alh-of these enzymes. A detergent additive of the invention, he. a separate 
additive or a combined additive, can be formulated e.g. as a granulate, a liquid, a slurry, etc, 

3d Preferred detergent additive formulations are granulates, in particular non-dusting 
granulates, liquids, in particular stabilized liquids, or slurries. 

Non-dusting granulates may be produced, e.g., as disclosed in US 4,108,991 and 
4,661,452 and may optionally be coated by methods known in the art. Examples of waxy 
coating materials are polyethylene oxide) products ipblyaffsyteneglycol, PEG) with mean 

36 molar weights of 1000 to 20000; ethoxyiated nonyiphenofs having from 16 to 50 ethylene 
oxide units; ethoxyfated fatty alcohols in which the alcohol contains from 12 to 20 carbon 
atoms and in which there are IS to 80 ethylene oxide units; fatty alcohols; fatty acids; and 



mono- and 4h and triglycerides of fatty acids. Examples of film-forming coating materials 
suitable for application by fluid bed techniques are given In G8 1483591, liquid enzyme 
preparations may, for instance, be stabilized by adding a poiyoi such as propylene glycol, a 
sugar or sugar alcohol, lactic acid or boric add according to established methods. Protected 
5 enzymes may be prepared according to the method disclosed in EP 238218, 

The detergent composition of the invention may be in any convenient form, e.g., a 
bar, a tablet a powder, a granule, a paste or a: liquid, A liquid detergent may be aqueous, 
typically containing up to 70 % wafer and 9-30 % organic solvent, or non-aqueous. 

The detergent composition comprises one or more surfactants, which may be non- 
10 ionic including semi-polar and/or anionic and/or oationic and/or zwitierionlc. The surfactants 
are typically present at a level of from 0,1% to 80% by weight 

When included therein the detergent will usually contain from about 1 % to about 40% 
of an anionic surfactant such as linear aikylbenzenesuifonate, alpha-oleflnsnlronate, aikyl 
'sulfate (fatty alcohol sulfate}, alcohol ethoxysolfate, secondary aikanesuifonale, aipha-sulfo 
1 S tatty acid methyl ester, aikyl- or aikenyisuccinlo acid or soap. 

When included therein the detergent wiil usually contain from about 0.2% to about 
40% of a non-ionic surfactant such as alcohol ethoxyiate, nonylphenc! ethoxyiate, 
alkyipoSyglyooside, alkyldlmethylamineoxfcte, ethoxylated My acid monoethaholarnide, fatty 
acid monoethandamide, poiyhydroxy aikyl Mty acid amide, or N-acyl N«alkyi derivatives of 
28 glucosamine f glucamldes 5 '). 

The detergent may contain Q~0S % of a detergent builder or corapiexirig agent such 
as zeolite, diphosphate, triphosphate, phospnonate,. carbonate, citrate, nibilofhacetic acid, 
athySenediamshetetraacetic acid, diefhySeneiriaminepeniaacetie add, aikyi-- or alkenyisueclnic 
acid, soluble silicates or layered silicates (e.g. SKS-8 from Hoechst). 
25 The detergent may comprise one or more polymers. Examples are 

carboxymethylceliulose, polyvinylpyrrolidone}, poly (ethylene glycol), polyvinyl alcohol), 
po!y(vinylpyr!d5ne~N-oxide}, poly{vinylimklazole). polycarboxylales such as poiyacryiates, 
maieic/acryilc acid copolymers and lanryl metbaaylate/acryilG acid copolymers. 

The detergent may contain a bleaching system whfeh may comprise a HjAt source 
30 such as perborate or percarhonate which may be combined with a peracldTormlng bleach 
activator such as tefraacetylethylenediamlne ornonanoyloxyisenzenesulfonate. Alternatively, 
the bleaching system may comprise perpxyaeids of e.g. the amide, imide, or surface type. 

The enzyme(s) of the detergent composition of the invention may be stabilized using 
conventional stabilising agents, e.g., a polyo! such as propylene glycol or glycerol, a sugar er 
3$ sugar alcohol, lactic acid, boric acid, or a boric- acid derivative, e.g„ an aromatic borate ester, 
or a phenyl boronlo acid derivative sash as 4~fontiylphenyi boronic acid, and the composition 
may be formulated as described in e,g. WO 92/1W09 and WO 92/19708, 
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The detergent may also contain other conventional detergent Ingredients such as e.g. 
fabric conditioners including slays, foam boosters, suds suppressors, anti-corrosion agents, 
soil-suspending agents, anti-soli redapositien agents> dyes, bactericides, optica! brlghteners, 
hydrntropes, tarnish Inhibitors, or perfumes. 
5 It is at present contemplated that in the detergent compositions any enzyme. In 

particular the enzyme of the invention, may be added Irs an amount corresponding to 0.01- 
100 rng of enzyme protein per liter of wash ligour, preferably QMS mg of enzyme protein 
per liter of wash iiqoup in particular 0,1-1 mg of enzyme protein per liter of wash ilqour. 

The enzyme of the invention may additionally be Incorporated in the detergent 
10 formulations disclosed in WO 97/07202;- 

Deposit of Biological i^laferlal 

The following biological materials have been deposited under the terms of the 
Budapest Treaty with the DSktZ (Deutsche Sammlung von Mlkroorganismen unci 
is Zelikuituren GmbH, Mascheroder Weg lb, D-38124 Braunschweig, Germany), and given the 
following accession numbers: 

Deposit Accession Number Date of Deposit 

N&caMsppm alba DSM 1 S84T May 30, 2003: 

Nocardiopsis sp, DSM 16424 May 24, 2004 

20 These strains have been deposited under conditions that assure that access .to. the 

culture will be available during the pendency of this patent application to one determined by 
the Commissioner of Patents and Trademarks to be entitled thereto under 37 C.F.R, §1.14 
and 35 U.S.C. §122, The deposit reprints a substantially pure culture of the deposited 
strain. The deposit is available as required by foreign patent laws in countries wherein 

25 counterparts of the subject application., or its progeny are filed. However, It should be 
understood that the availability of a deposit does not constitute a license to practice the 
subject Invention in derogation of patent rights granted by governmental action, 

Nocardlopsis dassanviifei subsp. dassonvillel DSfd 4323S Is publicly available from 
DSMZ. It was also deposited at other depositary institutions as follows: ATCC 23219, IMRU 

30 1250, NCTC 10489. 

The invention described and claimed herein is not to be limited In scope by the 
specific embodiments herein disclosed:, since these embodiments are intended as 
Illustrations of several aspects of the invention. Any equivalent embodiments are intended to 
be within the scope of this invention. Indeed, various modifications of the invention In addition 

35 to those shown and described herein wl become apparent to those skied in the art from the 
foregoing description. Such modifications are also intended to fali within the scope of the 



appended claims. In the 'case of Conflict, the present disclosure indyding definitions will 
control. 

Various references are cited hetBln, the disclosures of which are Incorporated by 
reference in their entireties. 

EXAMPLES 



EXAMPLE 1: Cloning and expression of the protease from Moeardlapsfs d&ssmwWm 
&ub$p< das sonvv/fe/ BSkl 43235 

LB agar Described in Ausubal, F. M. at at (eds.) "Cunent protocols in Molecular 

Biology". John Wiley and Sons, 1895 

LB-PGagar LB agar supplemented wild 0,5% Glucose and 0..0S M potassium 
phosphate, pH 7.0 

15 PS-1 10% sucrose, 4% soybean flour, 1% Na s P0 4 -i2HA 0,5% CaCO Ss and 

0.01% piur/dmc add 

TE 10rnMTris-Ha ; pH7.4 
■I mM EDTA, pH 8.0 

TEL 50 mg/mi Lysozym In TE~buffer 

20 Thlocyanate SM guanldlum thiocyanaie 
100 mU EDTA 

0.8 % w/v Ndaurylsarccslne, sodium salt 

80 g thlocyanate, 20 mi 0.5 M EDTA. pH 8.0, 20 ml H„0 dissolves at 6SC. Cooi down to 

room temperature (RT) and add 0.8 g Ndaurylsarcoslne. Add H,G to 100 ml and 'filter It 

25 through a 0.2 p sterile filter. 

NH^Ac 7.8 U CHgCOONH, 

TER 1 ug/ml Rnase A in TE-buffer 

CIA Chioroform/isoamyl alcohol 24: 1 

SEC ID NO: 1 Is the QUA sequence encoding a preform of the protease from 
Ntmtdiq&ste dassomliiei subsp, mmmvWet DSM 43235, Nucleotides 400-1082 corre- 
sponds to the mature peptide encoding part, 

SEQ ID HO: 2 is the deduced amino acid sequence of SEQ ID NO; ! Amino acids 
35 -188 to -1 is the propeptide, and amino acids i to 188 the mature peptide. 
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ClomngafSEq jONO: 1 

The wild type was gtwm'isp- 3.$a^fe€$CH^harv^.ift-te following medium at 30*0: 

Trypiiease 20 g 

Yeast extract Sg 
6 FerrocNoride 8 rng 

Magnesiumsulfafe IS mg 
Distilled water ad 1000 m! 
pH was adjusted to 7.2 by addition of .sodium hydroxide 

Genomic ONA from Namrd(Qp$i$ dassonw#e/ sabsp, dmsonvfM DSU 43238 was 
isolated according to the following procedure; 

1 . Harvest 1 ,5 ml culture and resuspend in 100 pi TEL. Incubate at 37*C for 30 min, 

2. Add 500 pi thlocynate buffer and leave at room temperature for 10 min. 

3. Add 250 pi NH 4 Ae and leave at ice for 10 min. 

4. Add 500 pi CIA and mix. 

5. Transfer to a microcentrifuge and spin for 1 0 min, at full speed. 

6. Transfer supernatant to a new Eppendorf tube and add 0,54 volume cold isopropanol. Mix 
thoroughly, 

7. Spin and wash the DNA pellet with 70 % BOH. 
3> Rsssuspend the genomic ONA In 100 pi TER 

The genomic ONA was used as template for PGR amplification using below primers 
SEQ ID NOS, 3 and 4* The PGR fragment was Isolated on a Q J% agarose gel and digested 
with restriction enzymes Cia I and BamH I Prime?' 1565 (SEQ ID NO: 4) disrupts a BamH I 
25 Site by changing an arglnine codon from CGG to AGA and introduces a new Barflft I site 
downstream from the stop codoo. 
Edmers: 

1423: S'-GCT TTT AGT TCA TCG ATC GCA TCG GCT GCT CCG GGC COG GTC CGC 
CAG-3' (SEQ ID NO; 3) 

30 1 566: 6~ QCO GAT OCT ATT AGG TTC TGA TCC TGA GAG CGC AG-3' (SEQ ID NO: 4) 

The digested and purWi&d PGR fragment was ligatad to the Cia I and BamH I digested 
pSasmid pDG288NeoMCS»PramyQ/PforylH/cryiyAstab/Sav (United States Patent: 
5,955,310), 

The ligation mixture was used for transformation Into E. coll TOP 1 0P (Invitrogan BY, 
35 The Netherlands) and several colonies were selected for miniprep (QIAprep spin, QIAGEN 
GmbH, Germany), The purified plasmlds were checked for Insert before transformation into a 
stain of BaoiHus subiMis denvsd from S: mhiftis DM: 1S8S with dlsrdpted apf, npr and pel 
genes (Dldenehsen el al (1090), JL Bactersol, 172, 4315-4321). The disruption was 
performed essentially as descdbed in -BBctitm mbtBis and other Gram-Positive Bacteria," 
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American Society for Microbiology, p.818, sds, AJL Sorienshein, J.A. Hoch and: Richard 
Losick (1 893). Transformed cells were plated or? 1 % skim milk LS-PG agar plates, 
supplemented with 6 pglml cillofsrnphenicoj. The piatad cells were incubated over night at 
37*0 and protease containing colonies were idenfjfiad by a surrounding clearing zone, 
5 Protease positive colonies ware selected and tte coding sequence of fha expressed enzyme 
from the expression construct was confirmed fey DMA sequence analysis. 

Femientafiprt 

The Bsoitfus mhiiiss host ceil transformed as described above was fermented on a 
10 rotary shaking table (250 r.p.m.) in 500 mi baffcd Erienmeyer flasks containing 100 ml PS-1 
medium supplemented with 8 ,yg/m! chloramphenicol, at 37"C for 18 hours arid at 28°C 'for 
extra 4 days. 

EXAMPLE 2: Purification and characterization of the protease from Nocyirdiapsis 
i s dassonviitei sub$p< dassonvffioi DSM 43235 

Proleaseassaye 
1 ) pNA assay; 

ptik substrata ; Suc~AAFF-pMA (Sachem 1-1400}. 

Temperature : Room temperature (25*C) 
M Assay buffers :100mM sucelnlo add, lOOmfel HEPES, fOOmM CHES, lOOmlvi CABS, 

imU CaCia, ISOrriWi KCi, 0.01% Triton X-100 adjusted to pH-vaiues 2.0, 2.5, 3,0, 3,5, 4,0, 

5,0, m, 7.0, 8.0, 0,0, 10,0, 11.0, and 12.0 with NCi of NaOH, 

"20j*j. protease (diluted in 0,01% Triton X-100) is mixed with lOOul assay buffer. The 

assay Is started by adding 100pj pNA substrate (SOrog dissolved in 1,0ml OMSO and further 
26 diluted 45x with 0.01% Triton X-100). The increase In OQm- is monitored as a measure of the 

protease activity. 

2} Pfoiasym© AK assay: 

Substrate j Protazyme AK tasiet (oross-linked and dyed casein; from Megazyrne) 
30 Temperature : controlled (assay temperaturel. 

Assay buffers :100mM succinic acid, IdOmM HEPES, iOOmM CHES, 1G0mM CABS, 

1mM CaCI & ISOmM KCI, 0.01% T$m )^ 100 adjusted to pH-values 2.0, 2.5, 3.0, 3,5, 4,0, 

5.0, 6.0, 7.0, 8.0, 9.0, 10,0 and 1 1,0 with MCI or NaOR 

A Protazyrne AK tablet is suspended in 2.0mi 0.01% Triton X-100 by gentle stirring. 
35 S00i.il of this suspension and SOOji essay buffer are mixed in an Eppendorf tube and placed 

on ice. 20pJ protease sample (diluted in G.01% Triton X-100) is added. The assay Is initiated 

by transferring the Eppendorf tube to an Eppendori therimomixer,. which is set to the assay 
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temperature. The tube Is incubated for 15 minutes on the Eppendorf thermomlxsr at its- 
highest shaking rate (1400 rpm). The Incubation is stopped by transferring the tube back to 
the Ice bath. Then the tube is centrf? uged in an ieeeold centr&^e for a few minutes and 200pl 
supernatant Is transferred to a mlcrotiier plate, QDm is read as a measure of protease 
s activity, A buffer blind is included In the assay- (irssfead erf enzyme). 

The protease fermentation described in Example 1 was eenfrifuged (20000 x g, 20 
min) and the supernatant® were carefully decanted from the precipitates. The combined 

10 supsmatants were filtered through a Safe EKS plate in' order to remove the rest of the 
Bacillus host eels. The EKS filtrate was transferred to SfotM H&BO S: SmU succinic acid, 
1mM CstCk, pH7 on a 625 sephadex column and applied to a bacitracin silica column 
equilibrated in the same buffer. After washing the column extensively with the equilibration 
buffer, the protease was step-eluted with lOOmM H g B0 3i lOrnSVl succinic acid, 2mU CaCI 2 «- 

15 iy NaCL 25% Isopmpanol, pH?. The bacitracin efuate was transferred to SOmfvl H ;i 80 s , 
lOmfvl CHsCOOH, IvrM CaC\ if pH 4.5 and applied to a S sepharose HP column ecjufisbrated 
In the same buffer. After washing the columri -extensively with the equilibration buffer, the 
protease was eiuted with a linear NaCi gradient (0 -> 0.5M) in the same buffer. Fractions 
from the column ware analysed for protease activity (using the Profazyme AK assay at $7^0 

20 and pH 9} and active fractions were further analysed by SDS-PA<3&. Fractions, where only 
one band was seen op the coomassfe stained SDS--PA8E gel, were pooled as the purified 
preparation and was used for further characterization, 

pH^clbd k. 3 ..pH-sjabit^ai}gl^ 

25 The pHA assay was used for obtaining the pH-activIfy profile as well as the phh 

stability profile, For the pH-stabliity profile the protease was diluted lOx in the assay buffers 
and Incubated for 2 hours at 3.7°C. After incubation the protease samples were transferred to 
the same pH - pH 8, before assay for residual activity, by dilution In the pH 9 assay buffer. 
The Proiazyroe AK assay was used for obtaining the temperature-activity profile at pH S. The 

30 results are shown in Tables 1-3 heiow. 



56 



Table 1 : oH-aclivHy profile 



pM 


Protease derived from 
dassoiwHiei QBM 43235 


Protase derived from 
NomrSopsmsp, HRHL 18262 


2 


0.00 




3... 


0,00 


0.00 


4 

6 


0.03 


0.02 


0.11 


0.07 


£ 


0.21 


0.21 


7 


0,37. 


0.44 j 


8 


0.71 


0,*37 




0.97 


0.88 


10 


1,00 


1,00 


11 


0.94 


0.93 



Table 2; pH-stabiliiy profile 



| pH 1 Protease derived from 

j i subsp, tfrnsonvitfe; OBM 4323$ 


Protase derived from 
?¥ocsfc?^opsi ! s sp. UBRL 
18262 


2.0 


1.00 


&78 


1 

2.5 


0.95 


1.00 


1 3.0 


0.9? 


1,03 | 


1 3.5 


1,01 


_ 0 : 9§ . j 


L m 


0,98 


0.99 


1 §M 


0,97 


1.02 ! 


! 6.0 


0.98 


1.00 ! 


! 7.0 


0.96 


1.01 | 


f -— 

! 8.0 


0,99 


om 


! 9.0 


0,99 


0.99 | 


1 10.0 


0.96 


0.99 1 


1 11.0 


0.94 


0.86 ! 


L 12.0 


0.64 . 


......**......,.;; j 


j&O and after 2 
| hours at 
[5 !, C 


1,00 


i 
i 

1.00 i 
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Tem- 
perature 
.1151 


Protease derived from 
dBSS&twiU&l DSM 43235 


Profase derived from 
Nomrmpsissp, NRRL 

3$M 


IS 


0,08 


0.02 


25 


0,01 


0.02 


37 


0.03 


6.07 


50 


0.09 


0.20 


80 


0.18 


0.51 




0.63 




—a 


\m ': 







0.35 





DSC was used to -determine, temperature stability at pH 7.0 of th& proteases derived 
5 from Nocatdsopsis tiassonviijei mbsp. dmsonviiiei DSM 43235 and from NocarcMopste sp, 
NRRL 18202. Purified proteases were diaiysed over night at 4"C against 10 mM sodium 
phosphate., 50 mM sodium chloride, pH 7,0 and run on a VP-DSC instrument (Micro Cal) with 
a constant scan rale of irc/mM from 20 to 100 a C. Data-handling was perfefrned using: the 
MkxoCal Origin software. 
10 The resulting denaturation or melting temperatures, T^s, were: For the protease Of 

the invention derived from Nocattjfopsk dmsonviitei mbsp. dmsonviii&i for the 

protease derived from Nomrdhpm m NRRL 18262; 76.5*0. 



MM^Sta^isjics 

15 The protease was found to be inhibited by Phenyl Methyl Sulfonyi Fluoride, its 

relative molecular weight as determined by 8DS-PAGE was M s * 20kDa. and the ^terminal 
sequencer APIIGOLAWMOGRC, 



EXAMPLE 3: Performance of the Nocardhpsis das&mwltim subsp&cies dassonviUei 

20 DSM 43235 protease in a mdnogastne in mtro digestion model 

The performance of a purified preparation of the mature part of the protease having 
883 ID NO: 2 (prepared as described in Examples 1 and 2} was tested In an in vitro model 
simulating the digestion in monogasflic animals, in particular, the protease was tested for its 
ability to improve solubiilsation and digestion of mafeei-SSivl (mateeZ-soybsan meal) proteins. 

25 in the tables below, this protease Is designated K proiease of the invention. The in vitro 
system consisted of 15 flasks in which mafcteZ-Sfefel substrate was inliaily Incubated with 
HCi/pepsin - simulating gastne digestion - and subsequently with pancreailn - simulating 
intestinal digestion, 10 of the flasks were dosed with the protease at the start of the gastric 
phase whereas the remaining flasks served as blanks. At the end of the Intestinal incubation 
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phase samples of m vitro digssta were removed and analysed for solubilsed and digested 
protein. 



utline of, M,yv( rg.cl.lge,§tjon proQ&fara. 





pH 




Time 
course 


Simulated 


10 q mtoHSm substrate 
(6:4^41 m\ HCi (0.1G5M) 


3.0 




t-0min 


Mixing 


5 mi HCi (Q.105M)/ pepsin 
(3000 U/g substrate), 1 ml 
Please of the invention 


3.0 


40°C 


t"30 min 


Gastric digestion 


16 ml HzQ 


3.0 


4o , r 


t~ 1,6 hour 


Gastric digestion 


7 ml NaOH (0.39M) 


ei 


40*6 


t-I.S hours 


intestinal digestion 


5 mi NsHCOs ftM) / 
pancreatic (8 rng/g diet) 


6.8 


4Q«C 


t~2.0 hours 




f^msmnesnvu.^riCM 




40°C 


t-8.0 hours 





to 



Substrate; 
pH: 
HCi: 
pepsin: 
pancreatin: 



temperature: 40°C. 



4 g SBM, 8 g maize (prerolxed) 
3-0 stomach step/ 8.8-7,0 intestinal step 
0/105 M for 15 hours {i.e. 30 min HCksubstrate prsmixlng) 
3000 U /g diet for 1 hour 
8 rng/g diet for 4 hours 



is Mte 



20 



0.39 M HbQH 
0,105 M HCi 

0.105K4 HCI containing 8000 U pepsin per 5 mi 
1 MNaHCG;* containing 16 mg pancreatin per ml 
125; mU NaAobuffer, pH 6.0 



Enzyme protein deterrninations 

The amount of protease enzyme protein (in what follows, Enzyme Protein is 
abbreviated EP) Is calculated oh the oasis of tile hm values and the amino acid sequences 
25 (amino acid compositions) using the principles outlined in SXXGSI! & P,H. von Hippei, 
Analytical Biochemistry 182, 319-328, (1988).. 



E^rtmerttal procedure for ift v$«mpd# 

The experimental procedure was according to the above outilna. pH was measured at 
30 time 1 2.5, and 5,8 hours. Incubations ware terminated after 6 hours and samples of 30 mi 
ware removed and placed on Ice before centnfugatfon (10000 x g, 10 min, 4°C}, 
SupernsfariSs were removed and stored at -20*0, 
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wo zmun 1 1 zrn vcrmmimnmmi 

All samples were analysed for % degree of protein with the OPA method as well as 
content of soluhllised and digested protein using gel filtration. 

8 P.H determination by the QFA-method 

The Degree of Hydrolysis i(DH) of protein In different samples was determined using 
an semi-automated microliter plate based coiodmetdc method (NielsemPiA; Petersen 5 D<; 
Danvbmann.C, Improved method lor determining food protein degree of hydrolysis, J, Food 
Sei .2001, 86, 642-848), The OPA reagent was prepared as follows: 7,620 g dl-Na 

10 tetraborate decahydrafe and 200 mg sodsumdodecyl sulphate (SOS) were dissolved In ISO 
ml delonked water. The reagents were completely dissolved before continuing. 180 mg c~ 
phfhal-diaidehyde 97% (OPA) was dissolved In 4 ml sthanof. The OPA solution was 
transferred quantitatively to the above-mentioned solution by rinsing with delonized water, 
175 mg dithioihreifol 99% (D'TT) was added to the solution that was made up to 200 ml with 

15 deionized water, A serine standard (0.0016 meqv/i) was prepared by solubltising 50 mg 
serine (Merck, Germany) in S00 mi deionized waten 

The sample solution was prepared by diluting each sample to an absorhancs (280 
nm) of about 0,5, Generally, supsrnalants were diluted (100 xj using an automated Tecan 
dilution station (Mannedorf, Switzerland). All other spectrophotometer readings were 

20 performed at 340 nm using deionfaed water as the control. 25 pi of sample, standard and 
blind was dispensed into a microliter plate, The mleroditer plate was inserted Into an ISMS 
MF reader (babsystems, Finland} and 2G0yi of OPA reagent was automatically dispensed:. 
Plates were shaker? (2 mln; 700 rpm) before measuring absorbance. Finally, the DM was 
calculated. Eightfold determination of ail samples was carded out. 

25 

S§teipaiiioiuM 

The content of soluhliised protein In supsmatants from in vitro digested samples was 
estimated by quantifying crude protein (CP) using gel filtration HPLC, Bupernatanfs were 
thawed, filtered through 0,45 urn polycarbonate filters and diluted (1:50, v/v) with H s O. 

30 Diluted samples were ebfomatograpbed by HPLC using a Superdex Peptide PE (7.5 x 300 
mm) gel filtration column (Global}, The eiuenf used for Isocraflc elution was SO mU sodium 
phosphate buffer (pH 7,0) containing 150 rfiM H&Cl The total volume of elueot per run was 
28 mi and the flow rata was 0,4 mltein. Elution profiles were recorded at 214 nm and the 
total area under the profiles was Met^i^''Win^mSoh. To estimate protein content from 

35 Integrated areas, a calibration curve (R 2 «O.8903} was made from a dilution series of an in 
vsim digested reference maizeASSiy sample wih known total protein content. The protein 
determination in this reference sample was carded out using the KjeMabS method 

60 



(determination of % nitrogen; A.OA-C. {1984} Official Methods of Analysis 14th ed>, 
Washington DC). 

The content of digested protein was estimated by integrating the chromatograro area 
corresponding to peptides and amino acids: having a molecular mass of 1500 Dalfon or 

5 below (8avoi£ s L; Gaoihier r 8-P Dialysis Celi For The b-vltro Measurement Of Protein 
Digestibility. J, Food Set 1988, St, 494-498; Bahte^,L t - tfan,&M.J.M.; Boer f H.; 
Den,H*LA An in-vilco Method for Pmdictie-n of The Digestible Crude Protein Content in Pig 
Feeds. J, Sci, Food .Agr, 1990 s m,. 173~1?8; Soisen3.; Egguni t B.O. Critical Evaluation of In- 
vitro Methods for Estimating Digestibility in Simpie-Stomaoh Animals, Nutrition Research 

10 Reviews 1991, 4, 141-182). To determine the 1500 Dalton dividing line, the gel filtration: 
column was calibrated using cytochrome C (Seehringer, Germany), aprotinin, gastrin I, and 
substance P (Sigma Akfricb> USA), ae molecular mass standards. 

Results 

15 The results shown in Tables 4 and 5 below indicate that the protease increased the 

Degree of Hydrolysis (DH), as well as soluble and digestible protein significantly. 



T|ble4 

Degree of Hydrolysis (DH), absolute and relative values 



(dosage in mg EP/kg 



m 



\ Relative to bier* 



i n 



%0H 



5 | .28.84 



Blank ; 

! Protease of the invention I 
IftQCft. j S [ 28,21. 

Dtfef^nt'tettSrs w?thln tft? f&tmi &>w' nvkwis mgirirmi f M&mm& (1-wsy A&OVA, Timy-Km^m Usi, 
N0,05)< SO w Standard Devigtfon. %CV ~ Coe85efent of Variants* « (SO/mean valus) x 100% 



SD |%OH 
0.69 1 100,0 



0.35 



105.1 



t * 



!\2.0/ 



1 1.25 



f able 5 



Solubised and digested crude protein measured by AKTA HPLC. 



| Enzyme j 
1 (dosage In j 
|mg Wfkm 
\ feed) 1 n 


Of total pr otein 


Relative to blank 


%di& 
CP 


. 1 




SD 




cv% 




p icv% 


| Blank |5 


,M<1 




9o.i "F 


1.1 


100.0 1 * 




100.0 




I Protease of [ 
1 the invention 
I (SO) 1 5 


577 


b \ i.i 


03.2 |* 


1.4 


108.7 I b 


1.9 


1034 


: 1,L 


1(100) Is 


58.8 


0.8 


94.8 i u 


0.9 


108.9 I s 


1,3 




5 0.9 j 



25 Different ;c:.t:;rs.; wsihiri satires ostein SricES significant diWersnces ANOVA, Taksy-Krsfnar test, 

P<01>!>}. SD - Stan<feci Deviation. %CV » Cc&fSdant s>f Vartee© « (SD/wean valise) x 100% 
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EXAMPLE 4: Performance of the protease from Nocardio§isi$ dassonviiiei suhsp. 
dammv0M B$M 43235 In an aquaculture m vitm model 

The protease preparation as described in Example 3 was tested in an aquaeufiure in 
vitro model simulating the digestion In .©aWwater fish, The /# vitm system consisted of 15 
flasks fn which substrate was inifisiiy incubated wits HCI/pepsin - simulating gastric 
digestion - and subsequently with pancreatin »- simulating Intestinal digestion, 10 of the flasks 
ware dosed with the protease at the start of the gastric phase whereas the ^malnslng 5 flasks 
served as blanks. At the end of the intestinal incubation phase samples of in wire digests 
were removed and analysed for soluhilised and digested protein. 

Outline o f aqua in v itro digestion procedure- 



Components added | pH | Temperature 


Time 
course 


Simulated digestion j 
J3hase 


10 g extruded SBM wfo$ftm> \ 3.0 j WC 
62 ml HCi (0.155M)/pepsin j 
(4000 U/g substrate), 1 ml of 
the protease of the invention 


t~0 min 


Gastric digestion 


7fflLNaOH (1,1 M) j 6,8 1 1S*C 

"1ml. NaHC0 3 (livt} / pancreatin ! 6.8 115*0 
(8 nig I§ diet} i 1 


t~6 hours 
i~7 hours 


Jn^sJriaJ^i^sjOT 
Intestinal digestion 


Terminate incubation | 7.0 j 16*0 


t~24 

hours; 
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Conditions 
Substrate: 



HCi: 
Pepsin; 
Pancreatin; 
Temperature: 13*C 
Replicates; 5 



10 g extruded SBM 

3,0 stomach step/ (18-7.0 Intestinal step 
0,1 55 MforS hours 
4000 U/g diet for 6 hours 
8 mg/g diet for 17 hours 



SdotlsRS 
1.1 NaOH 

0.155 M HCI / pepsin {4000 U/g diet) 
23 1 M NaHCOs containing 18 mg panemaiin/mL 
125 mM NaAc-buffer, pH 6.0 



Experimental procedure for aoua in vitro model 

The experimental produce was according to the above outline. pH was measured at 
30 time 1 ( 5, 8 and 23 hours. Incubations were termSnafed after 24 hours and samples of 30 ml 
were removed and placed on ice before canbifugatlon (13000 x g, 10 min, 0 S C), 
Supemafaots were removed and stored at -20*0. 
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M supernatants were analysed using the OPA method (% degree of hydrolysis) and 
by AKTA HPLC to detemiine solybSfised and digested protein (see monogastrie example). 



§ Pre-ireafrneni of in ^fommrm^n\^y^''^^MP^:^iimm 

Before analysis on AKTA HPLC supernatants from the In v^ro system were 
pnetreated using solid-phase sample purification. This Was done to Improve the 
chromatography and thereby prevent unstable elusion profiles and baselines. The columns 
used for extraction were solid phase extraction columns {Chromabond EASY SPE Columns 
10 from Maoherey-Nagsi}. 2 ml mBQ water was etuted through the columns by use of a 
vacuum chamber (vacuum Q.1S x 100 kP$% Subsequently 3 ml in vitro sample was 
dispensed onto the column and etuted {vacuum 0.1 x 100 kPa), the fire! % ml of eiyted 
sample was thrown away and a clean tube was placed beneath the column, then the rest of 

lilt 
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E§su!M 

The results shown in Tables 6 and 7 below Indicate that the 
Increased Degree of hydrolysis and protein digestibility. 



so significantly 



20 Table 6: Decree of Hydrolysis (OH) 







Of total g>r 


ofein 


i Relative to BMk } 


\ im&BPikgi diet) 


u 


%DH \" 


1"SD" 


T%dh r~T%cv i 


! Blank ,1 5 


"2fM"\' s " 


T'qIz 




| Protease of the invention (§0) 




21J8 [ F 


'To.22 


T : ml p r T.oo ; 



Different letters within me same column Indicate sipciilicant differences {1-wsy ANQVA, Tufcsy-S- 
P<0.06), SD « Standard Deviation. %CV * Coeflfcferrt of Vansrsce ~ (SQ/rn»;sn vsfcs) :< 100% 



Table 7: Soiublhsed and digested crude protein 







Of total protein 


Relative to 


oisnk j 


{mq EP/&g 
diet) 


n 


%0F | 
dig J 


SD 


| %CP 
I sol 




%CP ; 
stitl 1 


% 1 %CP 

cv ! sni 
Ts'llodl) 


l ey \ 

I &/ : 


'Sienk 


5 


500 


2.2 


f'ssi 


t r """te 


100.0 j 55 




Protease of 


"5 ' 


32. 3 i 


1,1 


| 01.4 


j * | is 


ms \ - 


21 | 101.7 


1 **** 1 


th>s 

Invention 
(SO) 




i 
i 






| § 








7io6) 






0,4 


I 91.6 


I s j to 


107.0 ! ® 


"o77~TT6T.¥""" 





25 Qifrareni .tetters wsS-sin the saoie column indicate significant differences (1-w. 



^ ANOVA, M;key»Kfamer lasi, 



P<0 .05). SD = Standard Deviation. %C¥ ~ Coefficient of Variance ~ {SD/mean value) % 100% 



EXAMPLE 5: Construction of BmlHmmMMIs strains 12, 12 HV0 and 12 HV1 

Two "Tail" variants (having C^ormlnai e>tonslons) of the protease having the amino 
30 acici sequence of amino acids 1-188 of SEQ SD HO: 2 were also made. The tail variant 
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designated hereinafter 12 HVO had the following 8 amino acids extra in the C-terromus; 
QSBV'QSAP with the following DNA sequence e>dension inserted In front of the TAA 
stopeodon (between nucleotides 1059 and 1 060 of 8EO ID NO: 1): 
5'~GAATCGCAT<3TTCMTCCGCtC€^-3" (MQ ID MO; S) 
5 The tail variant designated 12 HV1 had the fdifowlng 4 amino acids extra in the 0- 

temilnus: OSAP, with the following DNA sequence extension inserted in front of the TAA 
stopcodon (between nucleotides 1 0S9 and 1060 of SEQ ID HO: 1): 
S'-CAATCGGCTCCT-S' (SEQ ID NO; 6) 

Three steins of Bacillus stiMtk were constructed: One harbouring the DMA construct 

10 encoding the protease having the amino add sequence of amino acids 1-188 of SEQ ID NO: 
2, denoted L2, and two harbouring the DNA constructs snood ing the tail variants 12 HV0, 
and L2 HVt respectively. These constructs were fused by PGR to the DMA coding for the 
signal peptide from SAVINASE™ Bacillus cl&usil (TakamiX; KobayasbiX; Kobaysshljvi; 
YamamoioM; Nakamura,S.; Aono,R,; HorikoshiHj Motecuiar cloning, nucieotide sequence 

15 and expression of the structural gene for alkaline protease from aikaiiphilic Bacillus sp, 221) 
Slosei, Biotechnoi. Blochem. 56:1455 (1902) and Integrated by homologous recombination 
on the Sacfe suhtlim host cell genome (described in Example 1), The genes are expressed 
under the control of a triple promoter system (as described in WO 00/43835), consisting of 
the promoters from BacSlus iiohenifonnis alpha-amyiase gene (amyL), Sadfe 

20 arnyloiiquefaciens afpha-amyiase gene (mfo®}, and the 8acf//ea tfturingkmsw otyMA 
promoter including stabilising sequence. The gene coding for Chloramphenicol acetyl- 
transferase was used as marker. {Described in eg. Dideriqhssn.S.; Pou!se?v3.8.; 
JoergensenAT,; A useful cloning vector for aacibssubtiiis, Piasraid 30:312 (1903)), 

Chloramphenicol resistance fransformants were -checked for protease activity on 1% 

25 skim milk LS-PG agar plates {supplemented with 8 pgfmt chloramphenicol;}. Seme protease 
positive colonies were further analysed by DNA sequencing of the insert for correct gene 
DNA sequence and a strain of each construct was selected, called strain B.sub0$ 12, 
WMte t-2 HVO, Rsu» 12 HV1 . 

30 EXAMPLE 8: Fermentation of the & suttilm straps L2, L2 HVO and L2 HVi 

The three 8, wMlis strains 12, 12 HVO and L2 HV1, were fermented on a rotary 
shaking table in 500 ml baffled Edenmeyer flasks containing 100 mi TV supplemented with 8 
mg/l chloramphenicol . 

Six Erienmeyer flasks for each of the three a. sUMis strains were fermented in 
35 parallel. Two of the six Erienmeyer flasks were Incubated at 37 S C {250 rpm), two at SOX 
(250 rpm), and the last two at 28% (250 rpm). 
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A. sample was taken from each shake flask at day 1, 2 and 3 and analyzed for 
pmteoiySc activity, 
table -i. Proteolytic activity at 3? a C 



[Strain 


ialatsve values at 


37*e 1 




Day 1 


Day 2 


Day 3 j 


[Li 


1.0 M.O 1 1.0 


\ L2--HV1 


1,4 


13 


12 j 




1.3 


11 


1.4 ] 










| Strain 


Relative 


values at 30°C ! 




Day 1 


Day 2 


Day 3 \ 




i n 

i -v» 


1.0 


1.0 I 


L2-HV1 


1,0 


1,2 


1.4 


! L2-HV0 


TT 


1.3 


1.3 J 



Table. 10, Protadyfic activity at 2§*C 



I Strain 


Relative values at 28*C 




Day 1 


Day 2 


Day 3 


1.2 


1.0 


1.0 


1.0 


i L2-HV1 


13 


11 


11 


I L2-HV0 


0>2 




1.1 \ 



As it can be seen from tabse 8-10 above the effect of the fails Increases the 
10 expression level for the S2A protease from Nocardiasis tfa$smvili&! sybsp. &assoiwillel 
08M 4323§ when expressed m 8, subftMs, An increase of up to 40% is observed: in this 
experiment, but overall an improvement is observed for both tali variants, L2 HV1 and 12 
HVo\ at all three temperatures tested. 

1 5 EXAMPLE 7; Protease 22 

A protease designated "Protease 22" was designed so as to comprise the following 
characteristic amino acids of fie mature part (amino adds 1188} of SEQ ID NO; 2; 10Y, 
38T, 82S, SSP, 99A, 100V, 114!, 118N, 120T. 122R, 125Q, 129Y, 130S, 131L 18SS, and 
171Y; wherein each position corresponds te a position of SEQ ID NO: 2. 
20 The maiura part of Protease 22 Is amino acids 1-198 of SEQ ID NO: 8. The DMA 

sequence corresponding to SEQ ID NO; 8 Is SEQ ID NO: 7, 

The D'NA sequence of SEQ ID NO: ? was constructed and introduced Into a Bacttius 
host for expression. The expressed protease was purified and characterized as an afpha-lylie 
protease (peptidase family S1 E and/or S2A). 
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The tgmperatufe-activity relationship of Protease 22 was measured at pHS, using the 
Protazyme AK assay of Example 2. For comparison purposes the protease from 
Nacardk>p&k sp, NRRL 1 8262 was included. The msufts are shown in Table 1 1 below. 

The danaturafen or malting temperature of Protease 22 was determined as 
s described In Example 2 to T m ~ S3v§°C. 



Table 11. Te mperature profiles 





Relative activity 


at pH 9 


Temperature f C) 


Protease 22 


Protease from 






MRRL 18282 


W 


U . vi i t> 


0.015 


25 


0.010 




37 


0.028 i 0.068 


60 


0.088 


0.199 


60 


0.138 


0,510 


70 


0,474 


1.000 


80 


1,000 


0,384 


90 


0*375 * 





From these results ft appears that Protease 22 has a higher melting tempemtbfe, and 
10 a higher temperature optimum at pH 9, both as compared to the Nocardiopm sp, NRRL 
18282 protease, vix, a melting temperature increase of 7<0°O, and a temperature optimum 
increase of around 1CPC, 

EXAMPLE 8: Protease L2a 
IS The mature part of protease L2a is amino acids 1-189 of SEQ ID NO: 10, SEO it? 

BO: 9 is the DMA sequence corresponding to SEQ 10 HO: 10, 

The DNA sequence of $EQ ID NO: 0 was constructed and introduced Into a BacHtus 

host for expression as described in Example 1, and purified as described In Example 2, 
Protease L2a Is an afpha-lytie protease (peptidase family S1E and/or S2A). 
20 The femperafure-actiVfly relationship of protease L2a was measured at pH9, using 

the Protazyme AK assay of Example 2. For comparison purposes the protease from 

Noeardfopsis sp, NRRL 18282 (Protease 10} was included. The results are shown in Table 

12 below. 

The denaturation or melting temperature of the L2a protease was determined as 
25 described in Example 2toTi«« 78.2-0, 
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Table 12: Temperafor e activity profile 



J emp8 I Mu ^5 


. — . 

L2a protease 


Protease 10 






0.02 




0.02 


0.02 


37 


O.OS 


0.07 


50 


0,13 


0.20 




U.41 


U.O f 


70 


0,79 


1.00 


80 


1.00 


0.39 


90 


0-28 





From these results ft appears that protease L2a has a higher melting temperature, 
and a higher iarnperature optimum at pH S, both as compared to the Hocwtliopm sp.. NRRL 
5 18262 protease, viz, a melting temperature Increase of 1.7*0, and a temperature optimum 
increase of around 1G°C. 

EXAMPLE S: Protease 8 

The mature pert of protease 8 Is amino acids 1-188 of SEQ ID NO: 12. SEQ ID NO: 
I D 1 1 1s the DNA sequence corresponding to SEQ ID NO: 12. 

The PNA sequence of SEQ ID NO; 1 i was constructed and introduced into a BeosHus 
host for expression as described In Example 1, and purified as described In Example 2. 

Protease 8 is an alpha-lyfie protease (peptidase family SI E and/or S2A). 

The temperature-activity relationship of Protease 8 was measured at :pH% using the 
15 Protazyme AK assay of Example 2. For comparison purposes the protease from 
NGcardiopsis sp. NRRL 18282 {Protease 10) was Included. The results are shown in Table 
13 below. 

The denaturation or melting temperature of Protease 8 was determined as described 
in. Example 2 to T m - 78.3*C. 

20 
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Table 13: Temosrature acOvsiy profsie 



Temnerature PC) 


Protease S 


Protease 10 

. — ■■ — - ■ 


IS 


0.02 


0.02 




28 


0,85 


0.02 

-< 


37 


0,10 


0.07 


SO 


0.2? 


0.20 


m 


0.58 


0.51 


70 


US 


1.00 


so 


0.49 


0,39 


90 







From these results it appears that Protease 8 has a higher melting temperature,' and' 
a higher temperature optimum at pH 9, both as compared to Protease 10, e.g. a melting 
5 temperature increase of 1 .S'-C. 

EXAMPLE 10; Performance of the L2a Protease In a (Vlonogastric in Vitro Digestion 
Model 

The performance of the purified L2a protease described in Example 8 was tested in 
10 an 7ft -vitro model simulating the digestion in mohogastrse animals, In eomparisog with the 
known protease derived from NdGartftopm sp. NRRL 18282 f Protease 10"). I rv particular, 
the protease was tested for its ability to improve soiublsatioh and digestion of maize^SBM 
{mafze/~soybean meal) proteins. The In vitro system consisted of 18 flasks In which maize/- 
SBM substrate was initially incubated with HCI/pepsin - simulating gastric digestion - and 
15 subsequently with paoereafin ~ simulating intestinal digestion. Eight of the flasks were dosed 
with the protease at the start of the gastric phase whereas the remaining ten flasks served as 
blanks. At the end of the Intestinal incubation phase samples of in vitro digssla were 
removed and analysed for soluhillsed and digested protein. 
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pH 




Time 
sours© 


digestion phase 




....... 






fixing 


■mmmr^ — 

(3008 U/g substrate), 1 ml protease (to 
Dfxstsin par kg of substrate) 


3.0 






GasiicliisloF"' 


18 ml H ? 0 


30 








7 mi M'aOB (0,39M> 


6 J 





mm 


diqesiicn 


5 mi HaHCOs / oancreailn {8 
JS8&disi) ...... 


8,8 




1*2.0 hours 


intestinal 

. . .v.^. w„.w.-w. 


Termirjateincubancf) _ 




...10!C 







Con ditions 

Substrate: 
5 pH: 
HQ: 
pepsin: 
panereatirs: 
temperature 
10 Replicates: 

Solutions 

0.39 M NaOH 
0.1 OS M HOI 

16 0,1.05 M HCI containing 6000 U pepsin per 5 rni 
1 :.M "N'aHCOis containing 18 rag panereatin per ml 
125 tfM IMaAe-bufter, pH 0,0 

iMyo^.PfP'tefo determinations 
20 The amount of protease enzyme protein (EP) is calculated on the basis of the 

values and the amino acid sequences (amine add compositions) using the principles 
outlined in S.Q.GIH & P.H. von Hippel, Analytical Biochemistry 182, 319-328, (1989). 

Experimental procedure for in vitro model 
25 The experimental procedure was according to tie above outline. pH was measured at 

time 1, 2,5, and 5,5 hours, incubations were terminated after 0 hours and samples of 30 mi 
were removed and placed on Ice before eentdfugation (10000 x g, 10 mln> 4°C), 
Supernatants were removed and stored at 20 ft C 

30 Analysts 

All samples were analysed for content of soiuhitised and digested protein using gel 
filtration. 



4 g SSM, 8 g make (premrxed) 

3.0 stomach step/ 8.8-7.0 snfeeflnai step 

0.105 M for 1,5 hours (i.e. 30 min HCi-substrafe premixing) 

3000 U ig diet for 1 hour 

8 mg/g diet for 4 hours 

40*0. 

n 



The content of solubilised protein In sunernataote from in vitro digested samples was 
estimated by quantifying crude protein (CP) using gel fixation BPLC, Supamafants were 
s thawed, filtered through 8,45 pre polycarbonate filters arid diluted (1:50, v/V) with HA 
Diluted samples were chromatographed ay HPLC using a Superdex Peptide PE (7,5 x 300 
mm} gel filtration column (Global). The eiuent used for isoeraise elution was SO mU sodium 
phosphate buffer (pH 7.0) containing ISO mU HaCi. The total volume of eiuertt per run was 
26 mi and the flow rate was 0.4 mi/mfe Eluifoh profiles were recorded at 214 nhi and the 

10 total area under the profiles was determined by integration. To estimate protein content from 
integrated areas, a calibration curve {R 2 *0.9993} was made from a dilution series of an in 
vitro digested reference mateeASBM sample with known total protein content The .protein 
determination in this reference sample was carried out using the Kjsidahl method 
(determination of % nitrogen; KO.KC. (1984) Official Methods of Analysis 14th ed., 

15 Washington DC). 

The content of digested protein was estimated by integrating the chromatograrri area 
corresponding to peptides and amino acids having a molecular mass of 1500 Daftob or 
below (SavoieX.; GaufhienS.F, Dialysis Ceii For The in-vitro Measurement Of Protein 
Digestibility. J. Food Set. 1088, 51, 494-498; Sabinszky.L; SfmOMXm Boer.R; 

20 Oen,H,L.A. An tn-vitro Method for Prediction of The Digestible Crude Protein Content in Pig 
Feeds. J. Set, Food Agr. 1990, 50, 173-178; Soisen.S.; Eggurn,8,G« Cdfical Evaluation of In- 
vitro Methods for Estimating Digestibility in Simple-Stomach Animals. Nutrition Research 
Reviews 1991, 4, 141-182). To determine the 1500 Daiton dividing line, the gel filtration 
column was calibrated using cytochrome C (Soehnnger, Germany), aproHnin, gastrin I, and 

25 substance P (Sigma Aidrich, USA), as molecular mass standards. 

The results shown in Table 14 below indicate that the l.2a protease, like Protease 10, 
significantly increased the level of soluble and digestible protein relative to the bianK. 
30 Furthermore, the L2a protease appears to at least numerically improve the level of digestible 
protein as compared to the known Protease 10, 
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Table 14; Sgiuhillseri and digested crude protein 



enzyme 


13 


; Kesao\ 


f © to osan* 














| %dige: 


stibte CP 


cv% 


j%sofufateCP 






Blank 


10 


1 100.0 


pi 


5.5 


1 100.0 


" 1 4 4 


L2a protease 


3 


1 118.1 


rr — " 


0J 


1 107,2 




■It I 


Protease 10 


5 


1 112.1 


fs ~ 


1.0 


f?10-2 


h 


}qM \ 


Different tetters w 


Hhsb 


he same 


column trsc 


icaie s; 


;ioifics;'!i differen 




(1-wav ANG 



test, P<0.G5}> SD « Standard Deviation. Ckseffseient of Variance * (SD/mesn vaius) x 100% 

5 EXAMPLE 1 1 ' fish Feeding Trial 

Rainbow trout (Oocorhyitem mykiss% ail female stock with an initial body weight of 
approximately 27.3 g, were fed a practical basal diet containing 49% SBM (Soy Bern Meal) 
and 12% FM (Fish Meal), without (Control), and with, addition of the protease of the invention 
Having amino adds 1-188 of SEQ ID NO: 2> In an amount of §0 mg protease enzyme protein 
10 / kg feed. Fish were fed ad libitum. The feeding trial was performed in 500 1 tanks, part of a 
semi-recirculating' unit with a water temperature of 1S*C #>• 1*0, with SO fish per tank and 4 
replicates per treatment. Survival, growth, and feed conversion were measured over an 85- 
days feeding period, 

the results are shown in Table 10 below, A significant Increase in bodyweigbt was 
i s obtained by addition of the protease of the invention to the diet, 



Table 15: fa vivo fish re sults 



f raatmersi \ Survival 

1 i % 


Initial Body 
Weight / g 


Fina! Body 
Weight / g 


Weight Gain 

iM 1 


rlantrir jioao+z-o^o 






277,7 +/- 5,3 j 


i Protease 1- ! 100.0+/- 0.0 
188 of SEQ j 
ID NO: 2 i 


27>3*MTG 


333.5+/- 4,7 


306.2 +/- 4.7 



EXAMPLE 12: Proteolytic Degradation of iVIajor Purified Soybean Proteins 

20 We have studied the ability of the: protease from Nomrdfopsis sp. NRRL 18262 and 

the protease of the invention having amino acids 1-188 of SEQ ID NO; 2 to degrade purified 
soy proteins. The purified soy proteins studied were three so-called anti-nutritional factors of 
soy protein: Soybean agglutinin (SBA), the soyhean Kunlte trypsin inhibitor, the soybean 
Bowman-BIrk inhibitor; and two of the major storage proteins from soy; Gfydnln and beta- 

25 eongiyoinin. 

These soy proteins were obtained as follows: BSA was purified from unhealed 
soybean meal Cprod. No. S-9833, Sigma) by affinity chromatography using N~aminocaproyl~ 
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beta-D-galaefopyranosyiamine Saphafose {Sigma} Fm0s®8 eiuflng from the column wet© 
analysed using SOS-PAGE (Tris-glyetno 4-20%), and fractions containing essentially pure 
protein were pooled. Soybean glyeinio and beta^englycintn were purified according to the 
procedures described by Fischer, M, eiai; Enzymatie Extectebilsty of Soybean Meal Proteins 
a and Carbohydrates: Heat and Humidity Effects* J.Agric,Food Chem. 2001, 49, 4483-4469, 
The' Bowman-Birlt inhibitor and the Kunlfe trypsin inhibitor are commercial products (Sigma 
T-9777 and Ruka 93618, respectively)* 

The purified soy proteins were incubated with tie two proteases for 4 hours at 3?°C 
and pH 6,5 (protease; soy protein a 1:10, based on Aaas). incubation buffer; 50 mU dimethyl 

10 glutaric add, 1 50 mM NaCf, 1 mU CaGI 2 , %01% Triton X-100, pH 6.5. 

The ability of the proteases to 'degrade each of these five proteins was estimated as 
the relative intensity decrease of the Intact soy protein band on SDS-PA6E (Tns-glyeine 4- 
20%) gals stained with Coomassie Brilliant Sine. In case of giyclnin and beta-conglycinin, the 
term "the Intact soy protein banc"' in fact refers to two and three bands, respectively, as these 

15 two proteins, when subjected to the reducing environment on the 8PS-PAGE gel, denature 
under the formation of two and three sub-units, respectively. 

The Intensity of the intact soy protein bands (before and after treatment with the two 
proteases) was determined by scanning the stained SOS gels, and the degradation of each 
of the five proteins by each of the two proteases determined as the intensify of the respective 

2D Intact soy protein hand after, relative to the intensity of the respective intact aoy protein band 
before the protease treatment. The percentage protein degradation is calculated- as follows;; 
100% - {{hand intensity after/ band Intensity before) x 100%), The results are shown in Table 
16 below. 

2§- 'Table 15. Percentage degradation of isolated soy proteins 



Protease 


SBA 


Bowman- 
BirK 

inhibitor 


Kunfe trypsin 
Inhibitor 


Giyclnin 


Beta- | 
cdngiycMn 


From Nocardiopsh zp< NRRL 

;. > C? V £. 


., 

too 


35 


94 


98 


95 


\ Protease having amino acids 
1-188 of SEQ ID NO; 2 


95 


81 


97 


100 


100 



EXAMPLE 13: Animal Feed and Animal Feed Additives 

An animal feed additive comprising the protease of the Invention having amino acids 
1-188 of SEQ ID NO; 2, in the form of a vitamins and miners! praroix, Is composed as shown 
30 in Table 17 below. The vitamins and the carotehoids are commercially available from DSM 
Nutritional Products. Mi amounts are ia g/kg. 
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Vitamin A 


ROVIMIXA500 


4.00 


Vitamin D3 


ROVifvtiX 03 500 


100 


Vitamin 6 


ROVitviiX & 50 Ads 


8,00 




ROVIMiX B2 80-SD 


1.0 




CAROPHYLL Yo 'OW 


10,0 




Choline chloride 50%, min. 


300.0 






60.0 


1 Zn Oxide 


12.0 




Fe Sulphate monohydraie 


20.-0 




Cu Oxide 


2.0 




Co Sulphate 


, 0-2 


Enzyme 


"Preisase havino amino acSte TlSS of SEQ 15 NO: 2 (enzyme protein) 


3.0 


j Wheat middlings 


570,8 



The Premix of labia 17 Is included in a trout feed with a composition as shown in 
S Table 18 below. The amount of each ingredient is indicated in % (w/w). The in-feed- 
concentration of the protease of the Invention is 51 mg protease enzyme protein per kg. 



Table 1-8: Trout feed composition 



Wheat 


11,20 


Soya 


14.00 


Fish meal 




Soya oil 


4.00 


Starch 


8.00 


Yeast 


4.00 


Choline chloride 75% 


jOlO 


Premix of Table 17 


1.70 



73 



wo zomnnm 



FCT 



Qrtgtoai (for S ! J8£5fSSiO?-i 5 

(This sheei is noi pan of and doss mst cobs* »s a shssS of 1% irSeffstM»s5 s^pitc38cr.) 



|(8dS«ag«r«f R«is8«g So Ospos-SSod 
5^S©ro«r<53«te«s(si or 0$Jw BSetegleai 
gates'!*} (PC? Ruis ISoss) 
0-1-1 ^fispargci Usaig 


l&hSY mod©] 
Version 3*50 (Build 0002,162) 


d>2 |lnW«a8o«al Awii«»tio« fib. 




6-3 lA$»$»jic&<?£@ or agent's fite r«fej«fs«s 
i 


10436 < 504 WO 



■s 

1-1 


[The ijsdtaaiions mads tretew relate to j 

i$he fepssiisd >-fi>CiOCf93f5:Sl«(S) Of 

father »i«tegi«sl mateftet rofsrosiS to i« 
jtsi» tfescriptiOR an; 


52 








iiSsss 




18 






1-3 

1-3-1 


iSefcrsfifsesStof! of a>ifscs!i 


pPSME DSMS-Beutsche Sammlxmg von 


Mikroor- 


1-3-2 


i Address steposissty inggtu&x 




0-38124 




1-3-3 


iDsfeafdspBsit 
iASiisssior! NumtiSf 




30 May 2003 (30.95,5 
DSHS 15847 


w 

>Q03) 






;A'Wit!o«a( Jmifcsatfetts 


jMoc&rdiopsis alba 

?r 




: ■. 




jOssigriatsd States ter Sfifhteri 
lit tiJieatisws at s ffisds 


■i 

: 


all d0slgjiat.ios5.is 






„ 
S-1 


ITlws indications below reiate to 
IfcSls deposited t»lcroor$an&m{$) or 
lofe&f bioltfgtea} material referred to !»: 
iSse descripSon o?s; 

mm 


S2 








pirss 








2-3 


sldsrsiificatfor: o? dsposit 
INsme of daaasifarj institeKor: 

jAddraas <rf$gpo5$ssry ifisStaiiiew 


igsnisK-ex;. vad SSslXku' 
jMasaiwsroder Wssg ib f 


I &wx&l wag von 
.turi2s» OisbH 

D-3S124 


Mikroor- 


2-3-3 


iDmoof daacssi! 




24 May 2004 (24, OS, J 


>0D4) 




2-3-4 


|Aeces8fc»5 Number- 




DSMS 16424 






2-4 


Ao'dst-soai taoleaSsirss 




Moaardiopsis «$* 






2«o 


j DtMjgr.ato<l Sfesto* for Whicb 
judications aro fdacte 




all dasigxsafcions 






















FOR 8 


ECBVfNG OFFICE USE OsMLY 






8~* 


[Tft!sfom> was rooetva*! wish ttj* 
ii?5$fif!-!fdk»5S!! ffppiVeatkJir. 
i&es or fsa) 




5-4-1 













74 



W0 2*!O4/m22O 



(This is noi pssi of &M #ss nut comi as a sfceei cf sbs totemsttanat appSsissut;;;} 



FOR (MTERf^ ATjONAL SPSEAU USE ONLY 



»-§ jthiss fwm was nssssi 
jiKtei-na&miai S«fos ! 

! 


ved fey the 1 


©3 


JUL 


. 0~oo4 Cos o'^aoo4) 


0-5-1 jAafhsjffessd eSfcssr 

! 1 






& ?T09 



wo nmnttm 



Ourref: 10436.S04-WO 

Additional Indications (Form PCT/RO/134) 



Statement regarding the "expert option' 5 

Deposited microorganisms: 

With rasped to the deposited microorganism set out on Form PCI/RO/134 we request 
the so-caiied expert option: 

Until the publication of the mention of grant of a European patent or, where applicable, 
for twenty years from the date of filing if the application has been refused, withdrawn or 
deemed withdrawn, a sample of toe deposited micro-organism is only to be provided to 
an independent expert nominated by the person requesting the sample (cf. Rule 28(4} 
EPC). And. as far as Australia is concerned, the expert option is likewise requested, 
reference being had to Regulation 3.25 of Australia Statutory Rules 1991 No 71, Also, for 
Canada we request that only an independent expert nominated by the Commissioner Is 
authorised to have access to a sample of the microorganism deposited. 

Bagsvssrd,. 21 June 2004 
Novozymas A/S 




Karen 0stergaard Pedsrsen, Patent Counsel 



CLAIMS 



1 An isolated polypeptide having protease activity, and having a meltifsg temperature 
(Tffls) of at least 78*0, as determined by DlffeFentsa! Scanning Calorimetry (DSC) in a 10 mM 
5 sodium phosphate, 50 irM sodium chloride buffer, pH 7.0. using a constant scan rate of 
I.S^C/min, wherein the poJ^epttde-fe^fe^e^- from 'the grmp consisting, of: 

(a) a polypeptide having an amino acid sequence which .has a degree of identity to amino 
acids 1 to 188 of SEQ ID NO: 2, 1 to 190 of SEQ ID NO: 8, 1 to 109 of SEQ ID NO: 10, 
and/or 1 to 188 of SEQ ID NO: 12 of at least 80%; 

10 (b) a polypeptide which is encoded by a nucleic add sequence which hybridizes under 
low stringency conditions with either of nucleotides 490-1062 of SEQ ID NO: ! 577-1104 of 
SEQ ID NO: 7, 580-1 152 of SEQ }0 NO: 9, and/or 602-1005 of SEQ ID NO: 1 1; and 
(o) a polypeptide which is encoded by a nucleic acid sequence which has a degree of 
identity to any one of nucleotides 433-1082 of SEQ 10 NO: 1, 577-1184 of SEQ ID NO: 7, 

15 586-1 152 of SEQ ID NO: 9, and/or 502-1065 of SEQ ID NO: 1 1 of at least 60%. 

2. An isolated nucleic add sequence comprising a nucleic acid sequence which 
encodes the polypeptide of claim 1 , 

20 3. An isolated nucleic acid sequence comprising a nucleic add sequence which 
encodes a poiypeptide having protease activity and a melting temperature {T«4 of at least 
78°C, as determined fey Differentia! Scanning Calorfmetry (DSC) in a 10 sodium 
phosphate, SO mM sodium chloride buffer, pH 7.0, using a constant scan rate of 1.5 *C/min, 
wherein the nucleic acid sequence 

26: (a) hybridizes under low stringency conditions with any one of nucleotides 400-1062 of 
SEO. ID NO: 1, 577-1164 of SEQ ID NO: 7, 080-1 152 of SEQ ID NO: 9, and/or 502-1065 of 
SEQ ID NO: 11; 

(b) has a degree of identity to any one of nucleotides 499-1082 of SEQ ID NO: 1577- 
1164 of SEO ID NO: 7, 588-1152 of SEQ ID NO; & and/or 502-1065 of SEQ ID NO: 1 1 of at 

30 least 60%; and/or : 

(c) encodes a poiypeptide which has a degree of Identity to amino acids 1 to 1 88 of SEO 
ID NO; 2, 1 to 198 of SEO ID NO; 8, 1 to 189 of SEO ID NO: 10, and/or 1 to 188 of SEO ID 
NO: 12 of at least 60%. 

35 4. A nucleic add construct comprising the nucleic acid sequence of any one of claims 2- 
3 operabiy linked to one or more control sequences that direct the production of the 
polypeptide In a suitable expression host 
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5 . A recombinant expression vector comprising ills nucleic acid construct of claim- 4, 

8, A recombinant host ceil comprising the nucleic acid construct of claim -4 or the vector 
s of claim S. 

7. A method for producing a polypeptide of claim 1, the method comprising: (a) 
cultivating a recombinant host cell of claim 6 to produce a supernatant comprising the 
polypeptide; and fh) recovering the polypeptide, 

10 

8. A transgenic plant, or plant part, capable of expressing the polypeptide of claim 1 , 

9. A transgenic, non-human animal, or products, or elements thereof, being capable of 
expressing the polypeptide of claim 1 . 

is 

10. A method for producing a polypeptide of claim 1 , the method comprising 
(a) cultivating my one of the following strains: 

(!) Nvcardt'Qpsis; dmsormiim subsp, dassonwfe/ DSM 43235, 
til) Nocwdmpm sp. DSM 16424, or 
20 {ill} Moc&rdiopsi's $\bQ DSM 1S647; and 

(b| recovering the polypeptide, 

11 Use of at least one polypeptide as defined In claim 1 (i) in animal feed: (it) In ths 
preparation df a composition for use in animal feed; p) for Improving the nutntidnal value of 
25 an animal feed; (lv) for increasing digestible and/or soluble protein in animal diets; (v) for 
increasing the degree of hydrolysis of proteins in animal diets; and/or (vl) for the treatment of 
proteins. 



12, An animal feed additive comprising at least one polypeptide as defined in claim 1; 
30 and 

(a) at least one fat-soluble vitamin, and/or 

(h) at least one water-soluble vitamin, and/or 

(c) at least one trace mineral. 



35 13, An animal feed composition having a crude protein content of SO to 800 g/kg and 
comprising at least one polypeptide as defined in claim 1 , or at least one feed additive of 
claim 12, 
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14. The feed composition of claim 1 3 which: Is a fssfvfeed, 

i 5. A composition comprising at least one polypeptide as defined in claim 1 , together with 
5 at least one other enzyme selected from amongst amylase, phytese, xytanase, galactanase. 
alpha-galaetosidase, protease, pnospnollpase, and/or beta-glucanase, 

16. Use of at least one polypeptide as defined In claim 1 in detergents, 

10 17. Nocarah@sis$p< DSM 18424. 

18. Nocardiopsfe albs DSM 15847, 
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10436, -SCNMra,ST25: 

sequence listing 



<1I0> mvozym& a/s 

<1>0> Proteases 

<130> 10436. 504-w*G 

<:MQ> 12 

<1?0> Patentm version 3.2 

uo> i 



<211> 1065 



<212> DNA 

<213> Nocardiopsis dassonvillei subsp. dassonvillei 8SH 4323-!> 
<220> 

<22I> CDS 

<222> (1) . . (1062) 

<220> 

<221> mat^peptide 
<222> (499) , , (1062) 

<400> 1 

get ctg qcc ccc gtc ccc cag acc ccc gf.e gee gae gac age gec 
Ala Pro Ala Pro Val pro Sin Thr Pro val Ala Asp Asp Ser Ala 
-16$ 160 -155 
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gee age atg ace gag gcg etc &&g cgC ,-gac etc. gac etc acc teg 90 
Ala ser "Met Thr Sly Ala leu tys Arq Asp Leu Asp teu Thr Ser 
-ISO --145 --X40 

gee qag gee qag gag ett etc ttq gcg -cag gaa gee gec ate gag 135 
Ala Glu Ala si u Glu t«u Leu Ser Ala Gin <*\u Ala Ala lie Glu 
-135 -130 -125 

acc. gae gee gag gee act gag gee gcq ggc gag gee tae ggc ggc ISO 
thr Asp Ala Glu Ala Thr <slu Ala Ala Sly Glu Ala Tyr Gly Gly 
-120 -1.15 -110 

tea ctg ttc gae acc gaq acc etc gaa etc acc gtg ctg gtc acc gac 228 
ser Leu phs Asp Thr Glu Thr Leu Slu Leu Thr val Leu val Thr Asp 
-105 -100 -OS 

qcc tec gec gtc gag gca gtc gaq gec ace qga occ cag qcc arc. gtc 276 
Ala Ser Ala val SI u Ala Val Glu Ala Thr Sly Ala Gl n Ala Thr val 
-90 -85 -80 -75 

gtc tee cac ggc acc gag ggc ctg acc qag gtc gtg gag gac etc aac 324 
Val ser His sty Thr Glu sty Leu Thr Glu val Val Glu Asp Leu ash 
-70 -65 -60 

QOC acc gac att ccc gaq age gtc etc gae tgg tae ccq gac gtg gag 372 
Giv Ala el u val Pro Glu ser val Leu Slv rrp Tyr Pro Asp vai Glu 
-55 -50 ' -45 

aqc gac acc gtc gtg qtc gag gtg ctg gag ggc tec gac gec gac gtc 420 
sir asp Thr val val val Glu val Lm Glu Gly ser asp Ala Asp val 
-40 -35 -30 

qcc qcc ctg etc qcc gac occ qgt: gtg qac tec tec teg gte egg gtg 468 
Ala Ala Leu Leu Ala Asp Ala Sly val Asp ser Ser ser val Arg val 
-25 20 -IS 

ag gag gec gag gag gec ccg cag gte tac gec gac ate ate ggc ggc 516 
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10436.S04-WO.ST2S 
61 u 61 u Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp lis He sly sly 
-10 ~S -1 1 S 

eta gee tac tac atg gac ggc one tgc tec gtc age ttc gee qcq acc 
Leu Ala Tyr Tyr Hal <sty sty Arq Cys Ser Val Sly Phe Ala Ala Thr 
' 10 IS 20 



564 



aac aqc gec ggt caq ccc got ttc gtc ace gec ggc eac tgc ggc ace. 612 

Asn ser Ala sly sin Pro sty Phe val thr Ala sty His Cys sly Thr 
25 ' 30 3.5 

Qtc ggc acc qqc gtg acc ate gee aac ggc acc ggc acc ttc caq aac 660 

val Gly Thr sly val Thr lie Sly Asn Gly Thr sly thr Phe sin Asn 
40 45 SO 

tcq gtc ttc. ccc gqc aac gac qcc qcc ttc §tc cgc age acc tec aac 20S 

Ser Val Phe Pro sly Asn Asp Ala Ala Phe Val Arg sly Thr Ser Asn 
55 ' 60 65 ' 70 

tte acc ctq acc aac ctq qtc teg cge tac aac tee ggc qqc tac cag 758 

Phe Thr Leu Thr Asn Leu val ser Arg Tyr Asn ser sly sly rvr sin* 

75 SO ' 8S 

teg gtg acc ggt acc age cag gec ceg gee ggc ten gec gtg tgc cgc 804 

Ser val Thr sly Thr ser Sin Ala Pro Ala sly ser Ala val Cys Arg 
30 v 95 100 



tec ggc tec acc acc ggc tgg eae tgc ggc acc ate cag gec cgc aac 85 

Ser Sty Ser Thr Thr Sty rrp Hi's cys sty Thr Us Gin Ala Arc Ash 
7 105 110 115 

cag ace gtg cgc tac ceg cag ggc acc gtc tac tee etc acc cgc ace 90 

Si h Thr Val Arg Tvr Pro Sin sly Thr val Tyr Ser \..m Thr Arg Thr 

120 125 * ' ISO 



•v 



948 



996 



aac gtg tgc gee gag ccc gac gac tec ggc ggt teg ttc ate tee aqc 
Asn Val cys Ala Si u Fro sly Asp ser sly sly ser Phe lie Ser sty 
135 140 ' 145 1SG 

teg cag gec cag ggc gtc acc tec ggc aac tec ggc aac tgc tec gtc 
Ser sin Ala Sin sly val Thr ser slv Slv ser Sty Asn cys ser val 
155 160 16 S 

gqc qqc acg acc tac tac cag gag gtc acc. ceg atg ate aac tec tqq 1044 
Sty Sly Thr Thr Tyr Tyr Gin Glu val Thr Pro mi lie Asn Ser Tr» 
170 175 ISO 

get gtc agg ate egg acc taa 1065 
Sly Val Arq lie Arg Thr 

is! 

<210> 2 
<211> .354 
<212> PRT 

<2X3> Hocardiopsis dassorrvillei sulssp. dassoiwlllei osn 43235 
<400> 2 

Ala pro Ala Pro val Pro sin Thr pro val Ala Asp ass ser Ala 
-165 -ISO ~lfs 

Ala Ser mt Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr ser 
■■150 -145 -140 

Ala Glu Ala Glu Slu Leu Leu ser Ala Gin Glu Ala Ala lis Glu 
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10436. S04-WO.ST.25 
-135 -130 -125 

Thr Asp Ala 01 u Ala Thr STu Ala Ala Glv ■ GTu Ala Tvr <Sly <Sly 
-120 -US -13.0 

ser Leu Phe Asp Thr <3l« Thr Leu Glu tm Thr Vkl Leu Val Thr Asp 

-MS -loo m 

Ala ser Ala val Glu Ala val Glu Ala Thr sly Ala Gin Ala Thr val 
-90 -85 -SO -75 

val ser His Gly Thr Glu Glv Leu Thr slu val Val Glu Asp Leu Asn 
-70 ' -05 ~60 

Gly Ala Glu val Pro Glu Ser val Leu Gly Trp Tyr Pro Asp Val Slu 
-55 -50 -45 

ser Asp Thr val val val Glu val Leu sly sly ser Asp Ala Asp val 
-40 -35 -30 

Ala Ala Leu Leu Ala Asp Ala Glv val Asp ser ser ser Val Arq val 
-25 -20 ' -IS 

Glu Glu Ala Glu Glu Ala pro Gin val tyr Ala Asp lie lie Gly Sly 
•10 -5 -1 1 S 

Leu Ala Tyr Tyr mt Gly Gly Arg cys ser val Gly Phe Ala Ala Thr 
10 15 20 

Asn Ser Ala sly sin Pro Gly Phe val Thr Ala Gly His Cys Glv Thr 
25 30 35 

val Gly Thr Gly val Thr lie Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 

Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arq Gly Thr Ser Asn 
55 80 m ' 70 

Phe Thr Leu Thr Asn Leu val Ser Arg Tyr Asn Ser Gly Glv Tvr Gin 
75 80 85 

ser Val Thr Gly Thr ser Gin Ala Pro Ala Gly ser Ala val Cys Arg 
90 95 ' 100 

ser Gly ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arq Asn 
105 110 " IIS 

Gin Thr Val Arg Tyr Pre Gin Gly THr Val Tyr ser Leu Thr Arq Thr 
120 ' 125 130 

Asn Val Cys Ala Glu pro Gly Asp ser Gly Gly ser Phe lie ser Gly 

Page 3 



wo nmnttm 
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10436, SS4-«0.,-ST25 
MS 



ISO 



5er Gin Ala Gin Sly val Thr Sep. sly Gly Ser Sly Asn cvs Ser Val 
155 180 * 165 



Sly Gly Thr Thr Tyr Tyr Gin Gly Val Thr Pro Met lie asr Ser Tn 
170 ' m 180 



Gly Val Arg lie Arg Thr 
18 5 



<210> 3 

<2tl> 48 

<212> DNA 

<213> Synthetic 



■<2.20> 

<223> mlsc„feature 
<Z%3> Primr 

<4m> 3 

gettttacjtt catcgatcgc atcggctgct ccggcccccg- teccccag 48 



<210> 4 
<22.1> 35 
<212> DNA 

Synthetic 



<220> 

<221> rsdscfeature 
<223> Primer 



<21.0> 5 

<21X> 24 

<2X2> DNA 

<213> Synthetic 



<2<20> 

<223> Extension 
<400> S 

caatcgcatg tteaatccgc tcca 24 



<210> 6 

<211> 12 

<2X2> DMA 

<213> synthetic 



<400> 4 

geggatccts ttaggttctg atcctgacac cccag 




ml sc„f eatu'fe 
Extension 



<400> 6 
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caatcggcte ct 12 

<211> 1164 
<212> DNA 

<213> syntheti c C'pmteasa 22*) 



<220> 



CDS 

(!) . < (1184) 



<220> 

<221> sig...:peptide 

<222> O-J - v (81) 

<220> 

<22 1> mi $c„.f eatu re 

<222> (82) . . (§76) 

<223> Propeptide 

<22Q> 

<22l> mat„.pept1 de 

<222> (577),, (11 €4) 



- / Art. " ' 

<400> i 

atg aaa aaa ccg ctg gga aaa att. gt.c pea age aca gca ctt ctt 45 

Mat cys Lys Pro leu <3ly tys lie val Ala ser Thr Ala Leu Leu 

•190 -18S -180 

att tea gtf gca ttt age tea tct att gca tea gca qct aca gga 90 
lie ser val Ala Phe Ser ser ser lie Ala sar Ala Ala Thr Gly 
••175 -170 -165 

gca tta ctg cag tct teg aca ccg gaa aca gat gca gtc tea atg 135 
Ala Leu Pro sin ser Pro Thr Pro 61 u Ala Ass Ala val Ser Met 
-ISO -155 -ISO 

caa gaa gca ctg caa sga gat ctt gat ctt aca tea qca gaa gca 180 
Sirs el u Ala Lay Gin Ar<i asd leu Asp Ley Thr Ser Ala Glu Ala 
-MS -140 -135 

9|& Sfa ett ctt get gca caa gat aca gca ttt gaa gtg gat gaa 225 
Sly 61 u tm Leu Ala Ala Gin Asp Thr Ala Phe Slu val asd Glu 
~130 -125 -120 ' 

Sga 9C8 gca gaa gca gca gga gat gca tat que gac tea att ttt 270 
Asa Ala Aia eJu Ala Ala sjv asp Ala Tvr <slv GTy ser val pHe 
-13-5 -110 -105 

gat aca gaa tea ett gaa ctt aca gtt Ctt gtt aca gat qca sea qca 318 
Asp rnr Glu ser leu Glu Leu Thr val t&d val Thr asd Ala Ala Ala 
-100 -9$ ~m 

gtt q&a gca gtt gaa gca aca gga gca gga aca gta ctt gtt tea tat 366 
Val Glu Ala val Glu Ala Thr sl y Al a Gly Thr val Leu val ser Tyr 
-85 -80 -75 

gga att gat ggc ctt gat gaa att gtt caa gaa ctg aat gea qcc eat 414 
Sly lie Asp sly Leu Asp Glu xle Val sin el a. tea Ash Ala Ala Asp 
-70 -Si -60 -55 

get gtt ccg ggc gtt gtt ggc tgg tat ccg gat gtt get qqa gat aca 462 
Ala Val Pro Gly Val Val Gly Trp Tyr Frs Asp val Ala Sly Asp Thr 
-50 ' " --4S " -40 

gtt gtc ctt gaa gtt ctt gaa gga tea ggc gca gat gtt tea ggc ctg 510 

Rage 5 



wo zomnnm 



10436. 504--WG , ST25 
val val im Glu val im si a. sly Ser Glv Ala Asp val ser slv Leu 
-35 -30 '*■ ' -25 

ctg gca gac gca gga qtc gac gca tea gca qtt gaa gtt aca aca tea S5S 
Leu Ala Asp Ala Sly val Asp Ala Ser Ala Val Si a val Thr Thr Ser 
-20 v 15 -10 

gat caa ccg gaa ctt tat aca gat att aft ggc ggc ctg qca tat tat 606 
Asp Gin Pro slu Ley Tyr Ala Asp tie He Sly Giy Lea Ala rvr Tyr 
~S -1 I S * 10 

atg gac ggc aga tqc age qtt ggc t tt gca gca aca aat gca tea ggc 
mt Sly Gsy Arg cys Ser val sly r>he Ala Ala Thr Ash Ala ser Sly 
IS 20 25 



gtt tea att ggc aat qqc aaa qqc qtt ttt qaa cga age att ttt ceo 
Val Ser lie Sly ash sly Lys «1y Val phe Glu Arg ser lie phe Pro 
45 50 55 



gaa ccg qqc gat tea ggc ggc tea tat att age qqc act caa qca caa 
Slu Pro my Asp ser Sly Giy Ser Tyr lie Ser gIv Thr Sin Ala Gin 
140 " 145 ISO 

ggc gtt. aca tea ggc ggc tea ggc aat tgc -agt get ggc ggc aca aca 
Giy Val Thr ser Giy sly ser civ ash cvs ser Ala sly Giy Thr Thr 
IS 5 160 " 165 170 



<210> 


8 


<Zll> 


388 


<212> 


PRT 


<213> 


Syn 


<400> 


8 



mt Lys tys Pro Leu Giy Lys Tie Val Ala Ser thr Ala Leu Lea 
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caa ccg ggc ttt gtt aca gca ggc cat tqc ggc aca gtt ggc aca cca 702 
Gin Pro Giy Phe val Thr Ala Giy Mis Cys Giy Thr val Giy Thr Pro 
30 35 40' 



"SO 



ggc aat gat tea gca ttt gtt aga csqe aca tea aat ttt aca ett aea 798 
Giy Asrs Asp ser Ala Phe val Arg Giy Thr Ser Asn Phe Thr Leu Thr 
60 65 70 

aat ctg gtt tea aga tat aat tea ggc qqc tat gca aea qtt aca age 848 
Ash Leu Val Ser Arg Tyr Asn ser sly Giy Tyr Ala Thr Val Ala Giy 
75 80 85 90 

cat aat caa gca ccg att qqc tea gca gtt tgc aga tea ggc tea aca 894 
His aso Glii Ala Pro He Giy Ser Ala val cvs Are ser Glv ser Thr 
95 100 ' im 

aea ggc tgg cat tgc ggc aca att caa gca aga aat caa aca gtt agg 942 
Thr Giy Trp His cys Siy Thr lis Gin Ala Arg ash Gin Thr val Arcs 
310 ITS " 120 

tat ccg caa ggc aea gtt tat agt ctg aca aga aea aca gtt tat aca 990 
Tyr Pro Gift Giy Thr val Tyr Ser Leu Thr Arg Thr Thr val cvs Ala 
12 S 130 ' 135 



1038 



1086 



tat tac caa gaa gtt aat ccg atg ctt agt tea tgg ggc ctt aca ctt 1134 
Tyr Tyr Gin Glu val ash Pro Met Leu ser Ser Trp Sly Leo Thr Leu 
175 ISO 185 

aga aea caa teg cat gtt caa tec qct eca 
Arg Thr Gin ser His val Gin Ser Ala Pro 
190 195 



1164 



I043S.504~WO.ST2S 
-190 -185 -180 

lie Ser val Ala Phe ser ser ser lie Ala Ser Ala Ala Thr sty 
--1.7S -170 -165 

Ala Leu pro Gin ser Pro thr Pro slu Ala Asp Ala Val ser- Met 

-155 -ISO 



Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala slu Ala 
-145 -140 -135 

Glu Slu Leu leu Ala Ala Girt Asp Thr Ala Pha Slu val Asp Gly 

-125 -120 



Ala Ala Glu Ala Ala Sly Asp Ala Tyr sly 61 y Ser val phe 
-US ' -110 -105 

Asp Thr Glu Ser Leu Glu Leu Thr val Leu val thr Asp Ala Ala Ala 
-100 -95 -90 

val Glu Ala val Glu Ala Thr sly Ala Gly Thr val leu Val Sar Tyr 
-85 -80 ' -75 

Gly lie Asp Gly Leu Asp Glu tie val sin Glu Lm Asn Ala Ala Asp 
-70 -65 -60 -55 

Ala val Pro sly val val Sly Trp Tyr Pro Asp val Ala Gly Asp Thr 
-50 ^45 -40 

val vat Leu Glu Val Leu Glu Gly ser sly Ala Asp Val ser Gly Leu 
-35 -50 ' -25 

Leu Ala Asp Ala Gly val Asp Ala Ser Ala Val Glu Val Thr Thr ser 
-20 ' -IS -10 

Asp sin pro Glu Leu Tyr Ala asp lie lie Gly Sly Leu Ala Tyr Tyr 
-S -1 i 5 " 10 

Net Gly sly Arcs cys ser Val slv Phe Ala Ala Thr Asti Ala ser Sly 
" 15 20 as 

Gin Pro Gly Phe Val Thr Ala Gly His cys Gly Thr val slv Thr Pro 
30 35 - 40' 

val Ser lie Gly Asn Gly Cys Slv Val phe. slu Arg Ser lis Phe pro 
45 50' ' 55 

Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ssr Asn Phe Thr Leu Thr 
80 65 " 70 

Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Ala Thr Val Ala Gly 



wo zomnnm 



10436. , ST2 S 
75 SO 85 90 

His As« Sin Ala Pro lie Sly Ser Ala val cys Aro ser sly .ser- Thr 
95 100 105 

Tfir 61 v Trp His cys sly Thr He 'Sin Ala Arg Asn sin Thr val Arg 
110 " IIS 120 

lyr Pro sirs sly Thr val Tyr Ser Ley Thr Arg Thr Thr val cys Ala 
125 130 135 

Slu Pro Slv Asp ser Sly Sly Ser Tyr lie Ser sly Thr sin Ala Sin 
140 r ' US 150 

Sly val Thr ser sly Sly Ser Sly ask- Cys ser Ala sly sly Thr Thr 
155 160 185 17G 

Tyr Tyr Sin Slu val Asn Pro mt im Ser Ser Trp sly leu Thr Leu 
175 ISO 183 

Arg Thr Sin ser His val Sin ser Ala Pro 
190 195 

<210> 9 
<;>X1> 115$ 

<2X2> OH A 

<213> ftocariJippsis sp. DSM 16424 
<220> 

<221> sig__pepticle 

<222> (1) . < (§;o 

<220> 

<22i> COS 

<222> (15 , . C1XS2) 

<220> 

<2 2 i> maiupepti de 
<222> (586) . . (11525 

<40Q> 9 

atq aga ccc tec acc ate gec tec gee gtc qge aca gga qca etg 45 
mi Arg Pro ser 'thr lie Ala ser Ala val Sly Thr sly Ala im 
-19$ " -190 -185 

gec ttc ggt etg qca ctq tec atq fjee ccc. gga gee etc gcg gcg 90 
Ala Phe Sly Leu Ala Leu Ser Met Ala Pro sly Ala Leu Ala Ala 
-ISO -175 -170 

ccc g«c etc gtc cec cag acc ccc gtc sec qac gac a«c gee gec 
Pro Sly Pro Val Pro Sin Thr Pro Val Ala Asp Asp ser Ala Ala 
-165 -160 -155 

sac atq acc gaa geg etc aag eat qac etc aac etc tec teg gec ISO 
ssr mt Thr 61 u Ala leu Lys Arg Asp Leu Asrs Leu sar ssr Ala 
-150 -145 -MS 



135 



gaa ccc gae gag ctq etc teg geg cag gaa gec gcg ate gag acc 
Glu Ala Slu Slu Ley leu ser Ala sin- Slu Ala Ala lie slu Thr 
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age gec cag ccc ggt ttc gtg acq gcg ggc cac tgc ggc acc gtc 
Ssr Ala sly <37n Pro Gly phe val Thr Ala Sly His cys Sly Thr val 
25 ' 30 35 

ggc sec gee gtg acc ate ggc gac g<?c cgc age gtc ttc gag cgc teg 
Gly Thr Ala Val rhr ite ely asp civ Arg 61 v Val phe sly Arg ser 
40 45 ' 50' 55 



366 



10436. 504-WO.ST25 
-135 -130 -125 

gac gee gag gee gec gaa gec gcg gqa gag get tac ggc ggc tec 270 
Asp Ala slis Ala ,41a sly Ala Ala Gl y S l y Al a Tyr sty Civ Ser 
-120 -115 -110 

ctg ttc gac acc gaa acc etc gaa etc ace gtg ctg gtg acc gac acc 318 
'teu Phe Asp rhr shi Thr Leu sly Leu Thr vat Leu val Thr Asp Thr 
-105 -100 -95 -90 

acq qcc gtc gac geq ate gag qcc acc gga gec gaq gec acc gtg qtc 
Thr* Ala val Asp Ala Val Slu Ala Thr Gly Ala Glu Ala Thr val Val 
••8S -SO -75 

acc cac ggc ace gac ggc ctg gee gag gtc gtg gag gac etc aac age 414 
Thr His Sly Thr Asp Sly Leu Ala alii val val sly Asp teu Asn Ser 
»/Q -65 -60 

gec gac pec ccg gcg ggc gtc etc ggc tag tac ccc gac atg gag age 462 
Ala Asp Ala Pro Ala Sly val Leu Glv Trp Tyr Pro Asp Met Slu Ser 
-55 -SO * -45 

gac acc gtg qtg gtc gag gtg ctg gag ggc tec gac gec gac gtc gee 510 
aso- Thr Val Val val Glu Val Leu Ski Sly ser Asp Ala Asp val Ala 
-40 -35 -30 

gec ctg etc get gac gec age gtg sac acc tec gec gtc egg gtg gag $58 
Ala Leu Leu Ala asp Ala 61 y vat Asp Ala ser Ala val Arg vat llu 
-25 -20 -15 -10 

gag gcg gag gag gtc ccg cag gtc tac gec aac ate ate ggc ggc ctg 606 
Sly Ala Glu Glu val pro Glh val Tyr Ala Asn lie lie Gly Gty Leu 
•5 -1 1 S 

gec tac acc atg ggc gga cgc tgc tec gtc ggc ttc gcg gcg aee aac 654 
Ala Tyr Thr Met Gly sly A'rg Cys ser val sly Phe Ala Ala Thr Asn 
10 IS 20 



702 



*50 



qtc ttc ccc ggc aac gac gec gec ttc gtc cgc ggc acc tec aac ttc 798 

val Phe Pre Glv Asn Asp Ala Ala Phe val Are sty Thr ser Asn Phe 
60 6S 70 

acc ctg aee aac ctg gtc tec cgc tac aac tec ggc qgc cac cag gcq 846 

Thr teu Thr Asn Leu val ser Arg Tyr Asn Ser Gly Sly His sin Ala 
75 SO 85 

gtg acc ggc aee age cag gee ccg qce: ggc teg gee gtc toe cgc tec 894 

val Thr Gly Thr ser Gin Ala Pro Ala sly ser Ala val cys Are ser 
90' 95 100 

ggc tec acc acc ggc tag cac tgc ggc ace ate cag gee cgc. aac cag 942 

Gly ser rhr Thr Sly Try His Cys Gly Thr lie Glh Ala Arg Asn slh 
105 110 r 1X5 

acc gtg cgc tac ccg cag qec ace gtc aac qcg etc acc cqc acc aac 090 

Thr Val Arq Tyr Pro Sin Gly Thr Val Asn Ala Leu Thr Arq Thr Asn 

120 " ' 125 ' 130 135 

gtg tec gee gag ccc eat. gac tec gge ege teg ttc ate tec ggc teg 1038 

val cys Ala sly Pre Sty Asp Ser Gly Gly Ser Phe lie ser sly Set 
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lQ4Mi.5Q4-m.ST2S 
140 M5 150 

cag gcc cag gge ptc acc tec age gac tec gee aac tge tec ttc qqc 
sin Ala sin sly val Thr Ser Sly sly Ser sly Asrs cys ser pfte sly 
155 ' « MS 

gqc a eg acc tac tac cag gag gtc gcc ccg atg ate sac tec tgg ggc 1134 
Sty Thr Thr Tyr Tyr (Sirs Sly Val Ala Fro Met iT& Ass Ser Trp Slv 
170 17 S ISO 

git cgc ate cgc acc age tga 1155 
Vat Arq He Arg Thr Ser 
185 

<2W> 10 

<?.!!> 384 

<212> PRT 

<213> Nocardiosis sp< mm 1S424 

<400> 10 

Met Arg Pro Ser Thr lie Ala Ser Ala Val Sly Thr Slv Ala Leu 
-195 -190 -185 

Ala Phs sly Leu Ala Leu ser Met Ala Pro sly Ala Leu Ala Ala 
-180 -175 ~%m 

pro sly pro val f>r& Sin Thr Pro Val Ala Asp Asp ser Ala Ala 
-165 -160 --1S5 

ser wet Thr 61 u Ala Leu Lys Arq Asp Leu as« Leu ser Ser Ala 
-ISO -145 " -140 

Sly Ala Sly Slu Leu Leu ser Ala sin slu Ala Ala lie si a Thr 
-■135 -130 -125 

Asp Ala slu Ala Ala Slu Ala Ala sly Sly Ala Tyr Sly sly ser 

-115 HilO 



Leu phs Asp Thr sly Thr Leu slu Leu Thr val Leu val Thr Asp Thr 
-105 -100 -35 -90 

Thr Ala val Asp Ala val slu Ala rhr sly Ala slu Ala xhr val val 
-8S -80 -75 

Thr His sly Thr Asp sly Leu Ala slu val val slu Asp Leu As ft Ser 
-TO -65 -60 

Ala Asp Ala Pro Ala sly Val Leu sly Trp Tyr Pro Asp Met Slu Ser 
■55 -SO -45 

Asp Thr val val val slu val Leu slu sly ser Asp Ala Asp val Ala 
-40 -3$ -30 

Ala Leu Leu Ala Asp Ala sly val Asp Ala ser Ala val Arq val Slu 
~23 -20 -IS ■ -10 
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10436.504 -WO, ST2S 



filu Ala 61 u sTu val Pro Gin val Tyr Ala Asn lie lie Gly 61 y Leu 
-5 -1 1 5 



Ala Tvr Thr Met Glv Gly Arc* Cys Ser val sly Pne Ala Ala thr Asa 
' 3-0 IS 20 



Ser Ala Glv Gin Pro 61 v Phe val Thr Ala Gly His cys Glv Thr val 
25 " 30 3% 



Glv Thr Ala val Thr Us sly asp Gly Arg Sly val Phe slu Arg ser 
40' 45 50 SS 



val Phe pro gly Asn A$p Ala Ala Phe val Arg sly Thr ser Asn phs 
60 m ?0 



Thr Ley Thr ash Leu val ssr Are Tvr Asn Ser Gly Gly His Gift Ala 
75 " 80 83 



val Thr Glv Thr Ser Gin Ala Pro Ala Glv ser Ala val cys Arg ser 
90 95 100 



Gly $gr Thr Thr Glv Trp His Cvs Glv Thr lie Gin Asa Arq Asn Gin 
105 110 " 115 



Thr Val Arg. Tyr Pro 61 n Gly Thr val Asn Ala Lay Thr Arg Thr Asn 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Glv sly ser Phe xla ser Glv ser 
140 145 ISO 



Sin Ala Gin Gly Val Thr Ser Sly sly Ser Gly Asn cvs ser phe sly 
IS 5 160 165 



Glv Thr Thr Tyr Tvr Gin slu val Ala Pro Met He Asn ser Trp Gly 
170 17 S 180 



Val Arg lie Arg Thr Ser 



<213> Nocardiasis alba OSM 15647 



<Z2Q> 

<22t> COS 

<222> (l}..(10g5) 

<tzo> 

<221> mat„pept.ids 

<222> (S02) , . (1085) 




1068 
DMA 
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10436* 504-^.--ST2'5 

<220> 

•<221> siq eeptide 
<222> (552). .(1065) 

<400> II 

gcg acc ggc cue etc ccc cag tec ccc acc ccg gat gaa gtc gag 45 
Ala Thr Glv Pro uu Pro sin ser Pro Thr Pro Asp &u Ala Glu 
-16$ -160 -155 

gec acc acc atg gtc gag gee etc caq cgc gac etc ggc ctg tec 90 
Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 
-ISO -145 



ccc tct cag acc gac gag etc etc sag <icg csg gec gag tec ttc 13S 

Pro Ser Gin Ala Asp Glu Leu Leo Glu Ala , Gin Ala slu ser Phe 
-13 S -130 -125 

qm ate qac gag gec gec acc acq gee gea gec gac tec tac ggc ISO 

G'I'fi: lie Asd Glu Ala Ala Thr Ala Ala Ala Ala Asp ser Tyr sly 
-120 -IIS -110 

age tec ate ttc gac acc qac aqc etc ace ctg acc gtc ctg gtc acc 228 

sly ser lie PNe Asp Thr Asp ser Leu Thr Leu Thr val Leu val Thr 
-ICS -100 ~S5 



gac gec tec gec gtc gag gcg gtc gag gec gec ggc gec gag gee aag 
Asp Ala Ser Ala val 61 u Ala Val Glu Ala Ala Glv Ala Glu Ala Lys 
-90 -65 -SO 

gtg gtc teg cac ggc atg gag ggc ctg gag gag ate gtc gec gac ctg 
Val val Ser His sty ml Glu Glv Leu Glu Glu He val Ala Asp Leu 
-75 -70 -65 -60 

aac gcg gec gac get cag ccc ggc gtc gtg ggc tgg tac ccc gac ate 
•Ash. Ala Ala Asp Ala Glh Pro Gly Val Val Sly Trp Tyr Pro Asp lie 
•55 -50 " ' -45 

cac tee gac acg gtc gtc etc qag gtc etc <?aq qqc tec got gee gac 
His Ser Asp Thr Val val Leu Glu val leu Glu sly Ser Glv Ala Asp 
--40 -35 -30 



got etc qcc tac acc atg gut ggg cac tqc teg gtc qqc ttc gcg acc 
Gly Leu Ala Tyr Thr Met sty sly Afo Cys ser Val 61 y Phe Ala Ala 
10 " 15 20 



226 



372 



420 



gtg gac tec ctg etc gec gac gec got gtg gac acc qcc gac gtc aag 468 

Val Asp Ser Leu Leu Ala Asp Ala Glv Val Asp Thr Ala Asp val Lys 
-25 -20 -15 

gtg gag sac see acc gao cag ccc qag ctg tac qcc gac ate ate ggc 518 

val Glu Ser Thr Thr Glu Gin pro Glu Leu Tyr Ala Asp He He Gly 

-10 -5 -1 1 5 



56' 



acc aac gec tec ggc cag ccc gsq ttc gtc acc qcc ggc cac tgc ogc 612 

Thr Asn Ala Ser Gly Gin pro G'1v Phe Val Thr Ala Gly His cys Glv 

25 " 30 35 

acc gtc qgc acc ccg gtc age ate ggc aac ggc cag ggc gtc ttc qac 660 

Thr val Gly Thr Pro val sSr He Glv Asn ity Gin Glv Val Phe Glu 
40 45 SO 

eqt tee gtc ttc ccc ggc aac gac tec gec ttc gtc cgc ggc acc teg 708 

Arg ser Val Phe Pro Gly Asn Asp ser Ala Phe val Arc Glv Thr Ser 
55 60 65 

aac ttc acc ctg acc aac ctg gtc age cgc tac aac acc gut ggt tac 756 

Asn Phe Thr Leu Thr Asn Lou Val ser Arg Tyr Asn Thr Gly Gly Tyr 

70 75 SO 85 
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3.8436 504 -WO. ST 2 % 

gcg acc ate tec ggc tec teg cag gcg $m. ate ggc teg caq ate tgc 804 
Ala Thr val Ser civ Ser ser Gin Ala Ala lie sly. sen sin xle cys 
90" 95 100 

cat tec ggc tec acc ace ggc tgg car. tgc gqe acc gtc cag gec cgc 852 
Arg Ser sly Ser The Thr Sly'Trp His cys sl y Th r val Gin Ala Arg 
'105 110 115 



gqc caq acq gtq age tac ccc cag ggc acc gtg eag aac ctq ace cgc 
sly Gin Thr val ser Tyr Pro Gin sty Thr Val' Sin Asn. Leu Thr Arg 
120 125 . 130 



900 



acc aac gtc tac acc gag ccc ggt qae tec gee qqc tec ttc ate t'cc 948 
Thr Asn val Cys Ala Glu pro sly Asp Ser sly sty ser phe lie ser 
135 140 MS 

ggc aqc cag gec cag gqc ate acc tec qqt ggc tec ggc aac toe tee 996 
«1y Ser Gin Ala Sirs sly val Thr ser Gly Gly ser sty Asn cvs ser 
150 155 160 ' 165 

ttc gqt ggc ace ace tac tac cag gag gtc aac ceg atg ctq age age 1044 
Phe Gly Sly Thr Thr Tyr Tyr Sin Slu Val Ash Pro Met Leu Ser ser 
170 ' 175 ISO 

tgq gut ctg acc ctq cqc ace tga 1068 
Try) Gly im Thr Leu Arg Thr 
185 

<210> 12 
<211> 355 
<2t2> PRT 

Npcardippsis alba psm lf64:7 

<4QG> 12 

Ala Thr sly Pre Lm Pro Gin ser Pro Thr Pre Aso Glu Ala Slu 
-X6S -160 " -135 

Ala Thr Thr Met val Slu Ala Leu sin Arg Asp Oeu Sly ley Ser 
•150 -145 " -140 

Pro ser sin Ala Asp slu leu Leu Glu Ala Glh Ala Glu Ser phe 
-135 -130 -12S 

Glu lie Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp Ser Tyr Glv 
-120 -IIS -110 

Gly ser lie Phe Asp Thr Asn ser Leu Thr Leu Thr val Leu Val Thr 
-10S -100 -95 

Asp Ala ser Ala. val slu Ala Val Slu Ala Ala Sly Ala Slu Ala lys 
•90 -85 -80 

Val val Ser His sly Met Glu Glv Leo Glu slu Tie val Ala Asp Ley 
-75 -70 -65: ' -60 

Asn Ala Ala Asp Ala Gin Pro sly val val Sly Trp Tyr Pro Asp xle 
-55 -SO -45 
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104 36 .504 -WO. ST25 

His ser Asp Thr v&l val Leu <$lu Val Leu Si u Sly Ser -sly Ala Asp 
-40 -35 -30 



val Asp ser Leu Leu Ala Asp Ala Gly Val Asp Thr Ala Asp val tys 
--25 -20 -IS 



val Gl'u Ser Thr Thr &"lu Gin Pro Glu Leu Tyr Ala Asp lie He Gly 
•10 -5 -11 5 



sly teu Ala Tyr Thr Met sly Gly Arcs cys Ser val -sly Phe Ala Ala 
10 15 20 



Thr ash Ala ser sly sin Pro sly Phe val Thr Ala sly His cys sly 
25 30 35 



Thr val Gly Thr Pro val Ser Us Gly Asn Sly Gin Gly val PH& Glu 
40 45 50 



Arg Ser val pre pro Gly Ash ass s&r Ala phe val Arg Gly Thr ser 
55 ■ 60 6S 



Asn Phe Thr Leu Thr Asn Lew val ser Arg Tyr Asn Thr Gly Gly Tyr 
70 75 ' 80 85 



Ala Thr Val Ser Sly Ser Ser Sin Ala Ala lis Gly ser Gin lie cys 
90 9$ 100 



Ar.g Ser Sly Ser Thr Thr Gly Trp His Cys Gly Thr val Gin Ala Ar<j 
105 110 115 



Gly Gin Thr Val ser Tyr Pro Gin Glv Thr v&l Gin Asn Leu Thr Arcs 
120 12$ ' 130 



Thr Asn val Cys Ala slu pro Gly asp $ar Gly Gly Ser Phe lie ser 
135 140 ' 145 



Sly Ser Gin Ala Gin Gly Val Thr ser Gly Gly Ser sly Asn cys ser 



phe Sly Gly Thr Thr Tyr Tyr Gin Slu Val AsP Pro Met Leu ser ser 
170 175 180 



Trp Gly Leu Thr leu Arg Thr 
IS 5 



ISO 



155 




165 
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